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(57) Abstract: The present invention provides a host cell comprising a nucleic acid sequence encoding a recombinant polypeptide 
in which the production of a namrally occurring metalloprotease comprising a sequence provided in SEQ ID NO:l has been reduced 
or inhibited by genetic manipulation. The present invention also relates to methods for enhancing the production of a polypeptide 
from a cell by disrupting the synthesis or activity of the metalloprotease. In particular, the present invention relates to methods 
for enhancing the secretion of recombinant polypeptides from host cells such as, but not limited to, yeast and bacterial cells. The 
metalloprotease is a member of the pitiilysin subfamily of proteases, characterized by comprising the sequence HXXEH (SEQ ID 
NO:l), where X is any amino acid. 
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Methods for increasing tiie production of a recombinant polypeptide from a 
iiost ceil 

Fieid of tiie Invention 

The present invention relates to methods for enhancing the production of a polypeptide 
5 from a cell by disrupting the synthesis or activity of a metal loprotease from the dan ME 
(M16 family). In particular, the present invention relates to methods for enhancing the 
secretion of recombinant polypeptides from host cells such as, but not limited to, yeast 
and bacterial cells. 

Baclcground of the Invention 

10 Cholecystoklnin (CCK) is a vertebrate neuroendocrine peptide hormone that is 

expressed in both gut and brain tissues. The maturation of bioactive CCK peptides 
depends on post-translational tyrosine sulfation, endoproteolytic cleavages, 
exoproteolytic trimmings and carboxyterminal amidation. The endoproteolytic 
processing of the N-terminus varies with CCK-83, -58, -39, -33, -22, -8 and -5 being 

15 identified. Most of the CCK peptides are synthesized after cleavage at a single Arg 
residue, however, CCK-22 requires processing after a single Lys residue. 

Many recombinant polypeptides have been expressed In yeast as a fusion protein to the 
Saccharomyces cerevisiae a-factor prepro-peptide to direct secretion through the 

20 secretory pathway. The best characterized yeast protease is the serine endoprotease, 
Kex2p (Fuller et aL, 1989) which is involved in maturation of the a-mating pheromone 
and of killer toxin (Julius et ah, 1984). Another yeast protease is Ypslp belonging to 
the yapsin family of glycosyl-phophatidylinositol (GPI)-anchored aspartyl proteases, 
which is able to rescue mating deficiency when overexpressed in a kex2 mutant (Egeh 

25 MitanI et aL, 1990). Expression of foreign proteins have shown that Ypslp and Yps2p 
contain endoprotease activity. 

The use of host cells for the expression of recombinant polypeptides has greatly 
simplified the production of large quantities of commercially valuable polypeptides, 

30 which otherwise are obtainable only by purification from their native sources. There is a 
varied selection of expression systems currently available from which to choose for the 
production of any given polypeptide, including eubacterial and eukaryotic hosts. One 
Important factor in the selection of an appropriate expression system is the ability of 
the host cell to produce adequate yields of the polypeptide. However, a problem 

35 frequently encountered is the high level of proteolytic enzymes produced by a given 
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host cell or In the culture medium. Accordingly, there \s a need for further methods 
which enhance the production of a recombinant polypeptide from a host cell. 

Metalloproteases are the most diverse of the four main types of protease, with more 
5 than 30 families identified to date. In these enzymes, a divalent cation, usually zinc, 
activates the water molecule. The zinc metalloproteases can be divided based on the 
zinc binding site into for example Zincins and Inverzincins (Hooper, N.M. 1994). The 
metal Ion is held In place by amino acid llgands, usually three in number. The known 
metal llgands are His, Glu/ Asp or Lys and at least one other residue which may play an 
10 electrophllllc role is required for catalysis,. Of the known metalloproteases, around half 
contain an HEXXH motif> which has been shown in crystallographic studies to form part 
of the metal-binding site. 

A number of proteases dependent on divalent cations for their activity have been 
15 shown to belong to a single family, peptidase M16. Included are insulinase, 

mitochondrial processing protease, pitrilysin, nardilysin and a number of bacterial 
proteins. These proteins do not share many regions of sequence similarity; the most 
noticeable Is in the N-termlnal section. This region includes a conserved histldlne 
followed, two residues later, by a glutamate and another histldlne. In pitrilysin. It has 
20 been shown that this HXXEH motif is involved in enzymatic activity (Becker et al. 

1992); the two histidlnes bind zinc and the glutamate is necessary for catalytic activity. 
The X can be any amino acid. Non active members of this family have lost from one to 
three of these active site residues. 

25 It has previously been suggested that one could provide host cells and methods of 

producing proteins by expressing significantly reduced levels of a genetical modification 
In order to express significantly reduced levels of a metalloprotease containing an 
HEXXH motif in a filamentous fungal host cell, in e.g. US 5,861,280 (WO 98/12300). 

30 Others have provided a protease deficient filamentous fungus which Is characterised in 
that the filamentous fungus contains a site selected disruption of DNA that results in 
the filamentous fungus having reduced metalloprotease activity and isolated DNA 
sequences encoding a protein having metalloprotease activity, which is obtainable from 
a filamentous fungus (WO 97/46689). Again this metalloprotease contains an HEXXH 

35 motif. 

However, metalloproteases which can be reduced by a genetical modification in order 
to express significantly reduced levels of said metalloprotease in a non-filamentous 
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fungal host cell and other cells containing an motif other than HEXXH have never been 
descnbed. 

Summary of the Invention 

Whilst investigating the role various proteases play In processing proCCK in 
5 recombinant yeast, the present Inventors surprisingly noted that deletion/disruption of 
CYMl enhanced recombinant polypeptide production and secretion. Furthermore, the 
present inventors have found that Cymlp belongs to a family of metalloproteases, the 
activity of which can be down-reguiated to enhance the levels of recombinant 
polypeptide produced from a host celL 

10 

Accordingly, in a first aspect the present invention provides a host cell comprising a 
nucleic acid sequence encoding a recombinant polypeptide in which the production of a 
naturally occurring metalloprotease comprising a sequence provided In SEQ ID NO:l 
has been reduced or inhibited by genetic manipulation. 

15 

The host cell can be any cell which. In its native state, possesses the metalloprotease. 
Accordingly, the host cell can be a eukaryotic or prokaryotic cell. Examples of preferred 
eukaryotic cells include, but are not limited to, mammalian cells, plants cells and fungal 
cells. In a preferred embodiment, the host cell Is a yeast cell. More preferably, the 

20 yeast cell Is selected from, but not limited to, the group consisting of: Saccharomyces 
sp. such as Saccharomyces cerevisiae, Saccharomyces paradoxus, Saccharomyces. 
mikatae, Saccharomyces bayanus, Saccharomyces castellii and Saccharomyces 
kluyveri, Schizosaccharomyces sp. such as Schizosaccharomyces pombe, 
Kluyveromyces lactis, Hansen u la polymorpha^ Pichia pastoris, Pichia methanolica, Pichia 

25 kluyveri, Yarrowla lipolytica, Candida utilis, Candida cacaoi, and Geotrtchum 
fermentans. 

I^ettalloproteases are among the hydrolases in which the nucleophllic attack on a 
peptide bond Is mediated by a water molecule. This is a characteristic shared with 
30 aspartic proteases, but in the metalloproteases a divalent metal cation, usually zinc, 
but sometimes cobolt or manganese, actives the water molecule. The metal Ion is held 
in place by amino acid ligands usually 3 in number, the known metal llgands In 
metalloproteases are His, Glu, Asp or Lys residues. 

35 Metalloproteases can be divided into two broad groups depending on the metal ions 

required for catalysis, and in the literature metalloproteases have been allocated Into at 
least 8 different clans: MA, MB, MC, MD, ME, MF, MG and MH. Thus, illustrating the 
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complex diversity of this group of proteases. The allocation Is based on different 
consensus sequences due to the llgand binding, and thus each fannlly have different 
biological substrates and/or functions. 

5 The metalloproteases which are to be down regulated according to the present 

Invention Is a member of the pitrllysln subfamily (ME) of proteases, characterized by 
comprising the sequence HXXEH (SEQ ID NO:l) where X Is any amino add. Presently 
more than 180 members are annotated in Swissprot to the ME clan. Thus, the most 
preferred embodiment, the metalloprotease comprises a consensus sequence provided 

10 in SEQ ID NO:l. In another preferred embodiment, the metalloprotease comprises a 
consensus sequence provided In SEQ ID NO: 2. Even more preferably, the 
metalloprotease comprises a consensus sequence provided in SEQ ID NO:3. In 
addition, it Is preferred that the metalloprotease comprises SEQ ID NO:l and a 
glutamic acid residue between 70 and 80 amino acids C-terminal of the second His 

15 residue. Further, it Is preferred that the metalloprotease comprises a sequence selected 
from the group consisting of SEQ ID NO:l, SEQ ID NO:2 and SEQ ID NO:3 as well as a 
sequence selected from the group of: 

I) any one of group consisting of SEQ ID NO's 4 to 15, and 

20 11) a sequence which is at least 80% identical to any one of SEQ ID NO's 4 to 

15. 

More preferably, the metalloprotease comprises SEQ ID NO:l or SEQ ID NO:2 or SEQ 
ID NO: 3 as well as a sequence selected from the group of: 

25 

i) any one of SEQ ID NO's 4 or 5, and 

II) a sequence which is at least 80% Identical to any one of SEQ ID NO's 4 or 5. 

Preferably, the metalloprotease comprises a sequence which is at least 85% Identical, 
30 such as at least 90% identical, such as at least 95% identical, and such as at least 99% 
identical to any one of SEQ ID NO's 4 to 15. 

In a particularly preferred embodiment, the metalloprotease comprises a sequence as 
provided in SEQ ID NO:4, or a sequence at least 80% Identical, such as at least 90%, 
35 such as at least 95% and such as 99% identical, thereto. 



The host cell can be genetically manipulated by any means known in the art as long as 
the production of the metalloprotease Is reduced or inhibited when compared to a 
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parental host cell which has not been genetically manipulated. Such means of 
genetically manipulating the host cell include, but are not limited to; gene knockout, 
gene disruption, random or site directed mutagenesis, introduction of dominant- 
negative metalloproteases, RNA interference (RNAi) using dsRNA, catalytic nucleic adds 
5 (such as ribozymes and DNAzymes), and antisense nucleic acids. Preferably, the 
genetic manipulation acts directly upon the gene encoding the metalloprotease, the 
mRNA transcribed from the gene, or produces a protein that alters the activity of the 
metalloprotease such as a dominant negative mutant which competes with the 
metalloprotease for binding to a substrate but does not, for example, possess catalytic 
10 activity. However, the host cell may be genetically manipulated such that It indirectly 
affects the production or activity of the metalloprotease. For instance, the genetic 
manipulation can target a transcription factor involved in transcribing the mRNA 
encoded by the metalloprotease gene, thus at least reducing the levels of 
metalloprotease produced by the manipulated host cell. 

15 

Furthermore, the host cell may be further genetically manipulated such that it lacks at 
least one other naturally occurring protease of the host cell or has reduced activity for 
at least one other naturally occurring protease of the host cell. The protease can be any 
enzyme of which the Inhibition increases the production of a recombinant polypeptide 
20 produced by the host cell. The protease can either be an endopeptidase, an 

aminopeptidase or a carboxypeptldase. Preferred proteases Include serine proteases, 
aspartyl proteases, cysteine proteases and other metalloproteases. 

In one embodiment, the host cell is a yeast cell and the other naturally occurring 
25 protease(s) is at least one protease encoded by any of the protease genes selected 
from the group consisting of; KEX2, YPSl (previously known as YAP3), YPS2 
(previously known as |V|KC7), YPS3, YPS6, YPS7, BARl, STE13, KEXl, PRCl, PEP4 (also 
known as PRAl), APEl, APE2, APE3 and CPSl. Preferably, the host cell Is a yeast cell 
and KEX2 production has been disrupted. Similar naturally occurring protease(s) within 
other host cells than yeast In addition to the metalloprotease specifically described here 
in could also be disrupted and/or genetically manipulated for an further additive 
enhancement. 

The recombinant polypeptide can be any desired polypeptide which Is capable of being 
produced In the host cell. The recombinant polypeptide can comprise a naturally 
occurring sequence or has been produced by the intervention of man (e.g. a mutant or 
truncation of a naturally occurring protein, or a fusion between at least two different 
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polypeptides). Typically, the recombinant polypeptide will be of connnnercial value, for 
example in the treatment of diseases. 

The recombinant polypeptide can be any size. Typically, the recombinant polypeptide 
5 will range in size from about 30 amino acids to about 4,500 amino acids. In one 
embodiment, the recombinant polypeptide is between about 30 to about 200 amino 
acids in length. 

In at least some host cell expression systems for producing recombinant polypeptides, 
10 it Is desirable to direct the recombinant polypeptide to be secreted from the host cell. 
Thus, In a preferred embodiment, the nucleic acid comprises a sequence which encodes 
a signal for directing the recombinant polypeptide to be secreted from the host cell. 
Preferably, the signal Is an N-terminal hydrophobic signal sequence. Such N-terminal 
hydrophobic signal sequences are known in the art, and include, for example but not 
15 limited to, the leader sequence originating from the fungal amyloglucosidase (AG) gene 
such as galA - both 18 and 24 amino acid versions e.g. from Aspergillus sp., the a- 
factor gene such as yeasts e.g. from Saccharomyces sp. and Kluyveromyces sp., the P- 
factor of Schizosaccharomyces sp., and the a-amylase gene from Bacillus sp. In one 
embodiment, the recombinant polypeptide Is expressed as a fusion of an N-termlnal 
20 hydrophobic signal sequence and a second polypeptide sequence encoding the 

recombinant polypeptide which is from a different source than the signal sequence. 

The nucleic acid encoding the recombinant polypeptide can be provided to the host cell 
using any technique l<nown in the art. In one embodiment, the nucleic acid is inserted 

25 into the genome of the host cell using, for example, homologous recombination based 
techniques. In another embodiment, the nucleic acid is transfected or transformed Into 
the host cell in an expression vector which remains extrachromosomal. For example, 
the expression vector can be a plasmid or a virus. Further, it is preferred that the 
vector comprises a selectable marker which can be used to selectively propagate host 

30 cells comprising the vector. Such selectable markers and the use thereof are also 
known in the art. 

In a second aspect, the present invention provides a method of producing a 
recombinant polypeptide, the method comprising culturing a host cell according to the 
35 second aspect under suitable conditions such that the recombinant polypeptide is 
produced, and recovering the recombinant polypeptide. 
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Since the proteolytic action arising from the metalloprotease has been reduced or 
Inhibited, the method of the second aspect of the invention Improves the stability of the 
recombinant polypeptide produced by the host cell. 

5 In a preferred embodiment, the recombinant polypeptide is secreted from the host cell. 
Furthermore, it is preferred that the secreted protein Is recovered during exponential 
growth of a culture comprising the host cell. 

Preferably, the quantity of the recovered recombinant polypeptide Is higher than if a 
10 parental host cell was used. More preferably, the quantity of the recovered recombinant 
polypeptide is at 50% higher than if a parental host cell was used. 

In a third aspect, the present invention provides a method of cleaving a polypeptide at 
a basic residue, the method comprising contacting the polypeptide. In the presence of a 
15 divalent cation, with a metalloprotease comprising a sequence selected from the group 
of: 

I) any one of SEQ ID NO's 4 to 15, and 

ii) a sequence which is at least 80% Identical to any one of SEQ ID NO's 4 to 
15. 

20 

In a partlculariy preferred embodiment, the metalloprotease comprises a sequence as 
provided in SEQ ID NO:4, or a sequence at least 80% Identical, such as at least 90%, 
such as at least 95% and such as 99% identical, thereto. 

25 Preferably, the metalloprotease cleaves the polypeptide at the C~terminal side of an 
amino acid, or sequence of amino acids, selected from the group consisting of; Lys, 
Arg, ArgArg, LysLys, ArgLys and LysArg. Accordingly, It is preferred that the 
polypeptide comprises Lys, Arg, ArgArg, LysLys, ArgLys or LysArg. Other sequence 
requirements may also be necessary for cleavage, however, these can readily be 

30 determined by routine experimentation. 

Preferably, the divalent cation Is selected from the group consisting of: Zn^*, Co^"*" and 
Mn^"**. 

35 The method of the third aspect can be performed in vivo, within a recombinant host cell 
producing the metalloprotease, or in vitro In suitable reaction conditions. Considering 
the present disclosure, the skilled addressee could readily perform the method of the 
third aspect. An example of an in vitro system for cleaving a polypeptide with the 
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defined metalloprotease Is provided herein. In this Instance, the polypeptide is 
contacted with the metalloprotease provided In a crude yeast cell extract in 0.1 M 
NaH2P04 (pH 4.5) and in the presence of 1 ml^ Mn^* and 1 mM bestatin. In another 
example, the metalloprotease can be recombinantly produced as a fusion protein with a 
5 suitable "'tag", such as a HIs-tag, which enables easy purification of the fusion protein. 
Preferably, such a '"tag" is removed (for example by enzymatic cleavage) before the 
metalloprotease is exposed to the substrate polypeptide. 

In a fourth aspect, the present invention provides a method of Identifying an agent that 
10 inhibits the activity of a metalloprotease comprising a sequence provided In SEQ ID 
NO: 1, the method comprising the steps of: 

a) incubating the metalloprotease with the agent, in the presence of a divalent 
cation and a suitable substrate; 

15 b) determining the activity of the metalloprotease on the substrate; 

c) comparing the activity obtained in step b) with the activity of a control 
sample that has not been incubated with the agent; and 

20 d) selecting an agent that inhibits the activity of the metalloprotease. 

The substrate can be any polypeptide that can be cleaved by the metalloprotease and 
the cleavage event detected. One example disclosed herein is the use of CCK as a 
substrate, where the cleavage event Is detected by the production of CCK-22. Similar 
25 assays can readily be developed for other substrates. 

In a preferred embodiment of the fourth aspect, the metalloprotease comprises a 
sequence as provided in SEQ ID NO:4, or a sequence at least 80% identical, such as at 
least 90%, such as at least 95% and such as 99% Identical, thereto. 

30 

In a fifth aspect, the present invention provides a method of producing a recombinant 
polypeptide, the method comprising culturing a host cell comprising a nucleic acid 
sequence encoding a recombinant polypeptide under suitable conditions such that the 
recombinant polypeptide is produced, and recovering the recombinant polypeptide 
35 wherein said culturing comprises the presence of an inhibitor of a metalloprotease 
comprising a sequence provided in SEQ ID NO:l. 

Preferably, the inhibitor is identified according to a method of the fourth aspect. 
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As will be apparent, preferred features and characteristics of one aspect of tlie 
invention may be applicable to other aspects of the invention. 

5 Throughout this specification the word "comprise", or variations such as "comprises" or 
"comprising", will be understood to imply the inclusion of a stated element, integer or 
step, or group of elements, integers or steps, but not the exclusion of any other 
element. Integer or step, or group of elements. Integers or steps. 

10 The invention will hereinafter be described by way of the following non-limiting Figures 
and Examples. 

Brief Description of tiie Accompanying Drawings 

Figure 1. Cholecystokinin expression construct. Preprol^falp-proCCK fusion protein 
15 with the amino acid sequences around the fusion site and of the primary cleavage sites 
shown. The major forms of secreted CCK with their N- and C-terminal amino acid 
residues are shown below. 

Figure 2. CCK-22 maturation in cells and media as a function of cell growth. BJ2168 
20 expressing preproMfalp-proCCK fusion protein. The CCK-22 immuno-reactivity was 
measured by RIA using Ab 89009 and total CCK content measured with Ab 89009 after 
tryptic cleavage. Open circles represent the fraction of secreted CCK-22, whereas the 
intracellular fraction of CCK-22 is presented as filled triangles. The cell growth was 
measured by ODgooCopen squares). The data represent mean of two independent 
25 experiments. 

Figure 3. Chromatographic analyses of normal and K->A mutated CCK secreted from 
BJ2168. Media from yeast transformed with pRS426 preproMfalp-proCCK and pRS426 
preproMfalp-proCCK (K-^A) were subjected to G-50 gel chromatography and the CCK 
30 immuno-reactivity was measured with Ab 7270 specific for Gly extended CCK (A and C) 
and Ab 89009, which is specific for the N-terminus of CCK-22 (B and D). 

Figure 4. In vitro protease assay including inhibitors and activators. The fraction of 
CCK-22 was calculated from the immuno-reactivity using Ab 89009 divided by the total 
35 amount of mature and N-termina! extended CCK-22 measured with Ab 89009 after 
trypsin treatment. A, Effect of different inhibitors. B, Protease reactivation by addition 
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of 1.2 mM divalent metal ions to extracts where the activity had been Inhibited with 1 
ml^ EDTA. The data represent mean dk SD of three Independent experiments. 

Figure 5. Protease reactivation by Zn^"" and l^ln^"*". In vitro protease assays performed 
5 with cell extracts from UY123, where the activity was inhibited with 1 mM EDTA (filled 
squares) and reactivated by addition of 1.2 mM Mn^"*" (open circles) or 1.2 mM Zn^^ 
(filled circles). The activity was measured as the fraction of matured CCK-22 after 30, 
60 and 120 min incubation. The fraction of CCK-22 was calculated from the Immuno- 
reactivlty using Ab 89009 divided by the total amount of mature and N-termlnal 
10 extended CCK-22 measured with Ab 89009 after trypsin treatment. The data are 
represented by mean ± SD {n=3). 

Figure 6. Extracellular CCK-22 maturation by members of the yapsin family. The 
ability of intact cells to process extracellular CCK was analysed as described under 

15 ''Experimental Procedures" for BY4705 and the isogenic /psi, ypsl yps3 and ypsl yps2 
yps3 strains. The fraction of CCK-22 was calculated from the immuno-reactivity using 
Ab 89009 divided by the total amount of total CCK measured with Ab 89009 after 
trypsin treatment. The data are represented by mean ± SD (n=4). Statistics were 
performed using unpaired t test as described in experimental procedures = 

20 P<0.001, = P<0.01 and * = P<0.05). 

Figure 7. Increased proteolysis following Cymlp overexpresslon. In vitro protease 
assays performed with cell extracts from BJ2168 transformed with an empty pRS425 
plasmid (A) and with pRS425 containing cmi (B). The CCK-22 Immuno-reactlvlty was 
25 measured over time using Ab 89009 (filled squares and circles) and the total amount of 
mature and N-terminal extended CCK-22 measured with Ab 89009 after trypsin 
treatment (open squares and circles). The data represent mean ± SD of three 
independent experiments. 

30 Figure 8. Effects of KEX2 and cmi deletions on proCCK secretion and CCK-22 
maturation. Yeast cells transformed with the proCCK expression construct were 
harvested during exponential phase and the media collected. The Intracellular (A) and 
extracellular (B) amount of total CCK was measured with Ab 89009 after trypsin 
treatment. The fraction of Intracellular (C) and secreted (D) CCK-22 was calculated as 

35 the Immuno-reactivity measured with Ab 89009 before tryptic cleavage divided with the 
total amount of CCK measured in (A) and (B). The i<ex2, cyml and i<ex2 cyml strains 
are Isogenic to BJ2168. The data are given as mean ± SD (n=4). Statistics were 
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performed using unpaired t test (*** = P<0.001, ** = P<0.01 and * = P<0.05). The 
stars enclosed in brackets are a comparison between the kex2 and kex2 cyml strain. 

Figure 9. Intracellular degradation of CCK depends on Cymlp cleavage to CCK-22. 
5 Expression of wild type CCK, preproMfalp-proCCK, and the CCK mutant, preproMfalp- 
proCCK (K->-A) in BJ2168 and a CYMl disrupted strain isogenic to BJ2168. The cells 
were sedlmented during exponential growth and the total amount of CCK (hatched 
bars) was measured after trypsin and carboxypeptldase B treatment with Ab 7270 
specific for Gly-extended CCK. The amount of mature Gly-extended (white bars), which 
10 is dependent on translocation Into the secretory pathway, Kex2p and carboxypeptldase 
activity Is measured as the Immuno-reactlvlty using Ab 7270 before tryptic cleavage 
and carboxypeptldase B treatment. The data are given as mean ± SD (n=3). 

Figure lO. Aspartyl proteases involved in the maturation of CCK-22. Expression of wild 
15 type proCCK in BY4705 and the isogenic yapsin deletion strains of YPSl, YPS2 and 

YPS3. The intra- (A) and extracellular (B) fraction of synthesised CCK-22 was measured 
during exponential growth. The fraction of mature CCK-22 was calculated as the 
immuno-reactivlty measured with Ab 89009 before tryptic cleavage divided with the 
total amount of CCK. The data are represented by mean ± SD (n=3). Statistics were 
20 performed using unpaired t test as described in experimental procedures (*** = 
P<0.001, ** = P<0.01 and >k = P<0.05). 



Figure 11. Cymlp processing C-termlnally to both Lys and Arg residues. CYMl 
deletion enhance the amount of secreted CCK more than two fold of both wild type CCK 
25 and the Lys" -> Arg" mutant. Expression of wild type CCK, preproMfalp-proCCK, and 
the CCK mutant, preproMfalp-proCCK (Lys" Arg") in BJ2168 and a CYMl disruptant 
isogenic to BJ2168. Yeast cells were harvested during exponential phase and the media 
collected. The intracellular (A) and extracellular (B) amount of total CCK was measured 
with Ab 89009 after trypsin treatment. The data are given as mean ± SD (n=3). 



30 



Figure 12. Secreted proCCK fragments Identified by mass spectrometry. The CCK- 
numbers refer to C-terminal amidated CCK. The molecular masses are given as 
monolsotopic values except for * which denote average value. Strain A, vacuolar 
protease-deficient strain (BJ2168), and B, the isogenic strain with KEXl KEX2 
35 disruptions (LJY22). 
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Figure 13. Model for the production of the C-termlnally extended CCK (A) and GLP2 
(B). Expression of these fusion peptides should be perfomned in a secSl mutant, or the 
pre-sequence of the a-mating factor should be removed to avoid translocation Into the 
ER. The amino acid sequences around the fusion sites are shown. Underlined are the N- 
5 and C-terminal amino acids of the Gly-extended CCK-22 and GLPl. 

Figure 14 

A. The preproMfalp-proBNP expression construct. 

10 

B. The preproMfalp-KREAEA-BNP-32 expression construct. 

C. The preproMfalp-KR-BNP-32 expression construct. 
15 Figure 15 

A. The preproMfalp- proBNP expression construct transformed in BJ2168, UY430 
{cyml mutant), UY431 (ypsl mutant) and UY432 {cyml ypsl double mutant). Media 
was analysed from cells that have reached stationary phase using Ab 98192 that is 

20 specific for the N-terminus of proBIMP. The cyml, ypsl, and cymlypsl strains are 

Isogenic to BJ2168. The data are given as mean ± SD (n=3). Statistics were performed 
using unpaired t test as described in experimental procedures = P<0.001, = 
P<0.01 and * = P<0.05). The stars enclosed in bracl<ets are comparisons are between 
the wild type strain, BJ2168 vs. cyml, B32168 vs. ypsl and ypsl vs. ypslcyml. 

25 

(ns, = not significant). 

B. Analysis of proCCK fragments secreted from a cyml mutant. Media containg 10 
pmole proBNP was applied to Superdex 200 column on a Akta purifier system. The 

30 proBNP contant In the collected fractions were measured using Ab. 98192, that Is 
specific for the N-termlnus of proBNP. 

Key to the Sequence Listing 

SEQ ID NO:l - Consensus sequence for pitrllysin proteases. 
35 SEQ ID NO: 2 - Consensus sequence for at least some pitrllysin proteases. 
SEQ ID NO:3 - Consensus sequence for at least some pitrllysin proteases. 
SEQ ID NO:4 - Saccharomyces cerevisiae Cymlp (Swissprot Accession No. P32898). 
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SEQ ID NO: 5 - Schizosaccharomyces pombe CI 19.7 (Swissprot Accession No. 
042908). 

SEQ ID NO: 6 - Clostridium perfringens HypA protein (Swissprot Accession No. 
Q46205). 

5 SEQ ID NO:7 - Borrelia burgdorferi protein BB0228 (Swissprot Accession No. 051246). 
SEQ ID NO:8 - Caenorhabditis elegans C05D11.1 protein (Swissprot Accession No. 
P48053). 

SEQ ID NO:9 - E. coll protease III (Swissprot Accession No. P05458). 
SEQ ID NO: 10 - Rat NRD convertase (Swissprot Accession No. P47245). 
10 SEQ ID NO: 11 - Human insulysin (Swissprot Accession No. P14735). 

SEQ ID NO: 12 - Arabldops/s thallana CPE (Genbank Accession No. T03302). 

SEQ ID NO: 13 - Hunnan metalloprotease I (GenBank Accession No. AAH01150) In part, 

the full sequence (Swissprot Accession No. 095204). 

SEQ ID NO: 14 - Bacillus subtllis zinc protease ymxG (GenBank Accession No. Q04805). 
15 SEQ ID NO: 15 - Mycobacterium tuberculosis zinc protease Rv2782c (GenBank 

Accession No. 033324). 

SEQ ID NO'S 16 to 42 - Oligonucleotides. 

SEQ ID NO'S 43 to 52 - Sequences provided in Figure 12. 

SEQ ID NO'S 53 to 55 - Sequences provided in Figure 1. 
20 SEQ ID NO'S 56 to 65 - Oligonucleotides. 

SEQ ID NO:66 - Consensus sequence for at least some pitrllysin proteases. 

SEQ ID NO:67 - Consensus sequence for at least some pitrilysin proteases. 

SEQ ID NO:68 - Consensus sequence for at least some pitrllysin proteases. 



25 Detailed Description of tiie Invention 

The present invention provides a host cell useful for the expression of a polypeptide, 
said cell being genetically manipulated in order to at least produce reduced levels of a 
defined metalloprotease, when compared to the parental cell. The host cell will thus be 
able to express a protein of interest In higher quantity due to the proteolytic action of 
30 the metalloprotease has been reduced or Inhibited which improves the stability of the 
protein of interest. 

By the method of the invention, the proteolytic action of the metalloprotease has been 
reduced or inhibited, thereby improving the stability of the product obtained. 

35 

Thus, one embodiment of the present invention relates to a host cell useful for the 
expression of a protein of interest, wherein said cell has been genetically modified in 
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order to express significantly reduced levels of a metalloprotease comprising a HXXEH 
motif (SEQ ID NO 1) compared to the corresponding non-modified cell when cultured 
under Identical conditions. 

5 The metalloproteases which are to be down regulated according to the present 

invention do not share many regions of sequence similarity; the most noticeable is in 
the IM-terminal section. This region Includes a conserved histldlne followed two residues 
later by a glutamate and another histldlne. In pitrllysin. It has been shown that this 
HXXEH motif Is Involved in enzymatic activity; the two histidlnes bind zinc and the 
10 glutamate is necessary for catalytic activity. Non active members of this family have 
lost from one to three of these active site residues. 

The metalloprotease family which are to be down regulated according to the present 
Invention is presently classified as member of clan ME, family M16. This family is 
15 currently divided Into 4 subfamilies: 

M16A comprising pitrllysin 

M16B comprising mitochondrial processing peptidase beta-subunit {Saccharomyces 
cerevisiae) 

20 M16C comprising eupltrilysin {Homo sapiens) 

M44 comprising vaccinia virus-type metallolndopeptidase (vaccinia virus). 

Sequence alignments of these proteins show several sequence similarities. These 
sequence similarities are highly conserved and can be used to distinguish members of 
25 this family from non-members. 

Among such sequence similarities several Individual amino acids are highly conserved 
and are easily recognisable in specific positions navigated from the HXXEH motif. 

30 Thus, one embodiment of the present Invention relates to a host cell, wherein the 

metalloprotease comprises a glutamic acid residue between 70 and 80 amino acids C- 
terminal of the second His residue In the HXXEH motif. 



35 



A further embodiment of the present invention relates to a host cell, wherein the 
metalloprotease comprises a glyslne residue 3 amino acids N-termlnal of the first His 
residue in the HXXEH motif. 
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Another embodiment of the present invention relates to a host cell, wherein the 
metalloprotease comprises a glyslne residue 5 amino acids C-terminal of the second His 
residue in the HXXEH motif. 

5 One further embodiment of the present invention relates to a host cell wherein the 
metalloprotease comprises a lysine residue 8 amino acids C-terminai of the second His 
residue in the HXXEH motif. 

Also, one embodiment of the present Invention relates to a host cell, wherein the 
10 metalloprotease comprises a tyrosine residue 9 amino acids C-termlnal of the second 
His residue in the HXXEH motif. 

Furthermore, the present invention relates to a host cell, wherein the metalloprotease 
comprises a proline residue 10 amino acids C-termlnal of the second His residue in the 
15 HXXEH motif. 

Among the sequence similarities several regions of amino acids are also highly 
conserved and are easily recognised. Thus, in a presently preferred embodiment the 
invention relates to a host cell wherein the metalloprotease comprises the consensus 
20 sequence SEQ ID NO 2. 

In another presently preferred embodiment, the invention relates to a host cell, 
wherein the metalloprotease comprises the consensus sequence SEQ ID NO 3. 

25 In a presently most preferred embodiment, the invention relates to a host cell, wherein 
the metalloprotease comprises a NAXTXXXXT motif between 20 and 30 amino acids C- 
terminal of the second His residue in the HXXEH motif. 

In a presently another preferred embodiment, the Invention relates to a host cell, 
30 wherein the metalloprotease comprises the consensus sequence SEQ ID NO 66-68.0ne 
embodiment of the present invention relates to a host cell useful for the expression of a 
protein of interest, wherein said cell has been genetically modified In order to express 
significantly reduced levels of a metalloprotease which is at least 80% identical to the 
any of SEQ ID NO: 4-15, as compared to a parental cell. 

35 

In the present context, the term "protein of interest" relates to any of the numerous 
naturally native occurring extremely complex substances such as but not limited to 
proteins, enzymes and/or antibodies that consist of amino acid residues joined by 
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peptide bonds. It Is an object of preferred embodiments of the present invention to 
provide such native proteins which are products of the host cell itself and/or 
heterologous proteins, fusion proteins, recombinant proteins, eul<aryotic proteins, 
prol<aryotic proteins, lysosomal proteins, vacuolar proteins, precursor proteins, 
5 zymogene proteins, prepro-protelns, and secreted proteins. 

Preferrred embodiments of the claimed method are advantageous due to the higher 
production of the protein of Interest, thus any Increase of the amount of the protein of 
interest when produced In a host cell modified as described herein compared to the 
10 amount produced in the corresponding non-modified cell when cultured under Identical 
conditions are within the scope of the present Invention. 

One assay in which a skilled addressee could evaluate enhanced production of the 
protein of interest in a host cell modified as described here in and compared to the 
amount produced in the corresponding non-modified cell, is by culturing the two 
different host cells under identical condition, and measure the amount of produced 
protein of Interest by radio-Immune assay using an antibody specific for the protein of 
interest. One such assay Is describe in more detail in the examples of the present 
description. 

One embodiment of the present Invention relates to a host cell, wherein the total 
amount of the protein of interest Is increased at least 5% compared to the 
corresponding non-modified cell when cultured under identical conditions, such as at 
least 10% compared to the corresponding non-modified ceil when cultured under 
identical conditions, such as at least 20% compared to the corresponding non-modified 
cell when cuitured under Identical conditions, such as at least 50% compared to the 
corresponding non-modlfled cell when cultured under identical conditions, such as at 
least 100% compared to the corresponding non-modified cell when cultured under 
identical conditions, such as at least 200% compared to the corresponding non- 
modlfled cell when cultured under identical conditions, or even at least 1000% or 
compared to the corresponding non-modlfled cell when cultured under Identical 
conditions. 

In the present context, the term "host cell" relates to any cell capable of producing the 
35 protein of interest. Thus, in one prefered embodiment, the host Is a prokaryotic cell. In 
another preferred embodiment, the host cell is a eukaryotic ceil, such as but not limited 
to a filamentous fungal cell and a non-filamentous fungal cell. Non limiting examples 
hereof are a strain of Saccharomyces, especially Saccharomyces cerevisiae. 



15 



20 



25 



30 
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All the features described herein relating to the methods of the present invention are 
also applicable as embodiments relating to the host cells, and vice versa. 

5 The method described in the present application relates to the production of a protein 
of interest in a host cell, wherein said host cell has been genetically modified in order to 
express significantly reduced levels of a metalloprotease which Is at least 80% identical 
to the SEQ ID NO: 4 as compared to a parental cell, when cultured under Identical 
conditions, comprising 

10 

a) Introducing Into the host cell a nucleic add sequence encoding the protein of 
interest; 

b) cultivating the host cell of step (a) In a suitable growth medium for 
15 production of the protein of interest and 

c) isolating the protein of Interest. 

One embodiment of the present Invention relates to a method for the production of a 
20 protein of interest in a host cell, wherein the host cell has been genetically modified by 
a method selected from the group comprising gene knock-out, gene disruption, random 
or site directed mutagenesis, introduction of dominant-negative metalloproteases, RNA 
interference (RNAi) using dsRNA, catalytic nucleic acids (such as ribozymes and 
DNAzymes), antlsense nucleic acids or a combination thereof. 

25 

In a presently most preferred embodiment, the host cell is essentially free of any 
metalloprotease activity. 

One preferred embodiment of the present Invention relates to a method for the 
30 production of a protein of interest In a host cell, in which the protein of interest Is a 
eukaryotic protein, selected from the group comprising Insulin, growth hormone, 
glucagon, somatostatin, Interferon, adrenocorticotropic hormones, angiotensinogen, 
atrial natriuretic peptides, dynorphin, endorphlnes, galanin, gastrin, gastrin releasing 
peptides, neuropeptide Y fragments, pancreastatin, pancreatic polypeptides, secretin, 
35 vasoactive intestinal peptide, growth hormone releasing factor, melanocyte stimulating 
hormone, neurotensin, adrenal peptide, parathyroid hormone and related peptides, 
somatostatin and related peptides, brain natriuretic peptide, calcitonin, corticotropin 
releasing factor (CRF), cocaine amphetamine regulated transcript (CART), thymosin. 
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urotensin, glucagon and glucagon like peptides (GLP-1 and GLP-2), somatostatin, 
interferon, a vascular endothelial growth factor (VEGF), platelet derived growth factor 
(PDGF), factor VII, factor VIII, factor V, factor IX, interleukins, urokinase, 
erythropoietin (EPO), chymosin, tissue plasminogen activator. Positive cofactor 2 
5 glutamine/Q-rich-associated protein (PCAP), peptide tyrosine tyrosine (PYY), ghrelin, 
orexin, Beta-neoendorphin-dynorphin precursor, CCK or serum albumin. 

Another preferred embodiment of the present Invention relates to a method for the 
production of a protein of Interest in a host cell, in which the protein of Interest is a 
10 protein of fungal origin, selected from the group comprising an amylolytic enzyme, an 
alpha-amylase, a beta-amylase, a glyco-amylase, a alpha-galactosidase, a cellulytic 
enzyme, a lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an 
oxidoreductase, a peroxidase, a laccase, a pectlnase, or a cutinase. 

15 A further preferred embodiment of the present invention relates to a method for the 
production of a protein of interest in a host ceil, in which the protein of interest is a 
bacterial protein, selected from the group comprising an amylolytic enzyme, an alpha- 
amylase, a beta-amylase, a glyco-amylase, a beta-galactosidase, a cellulytic enzyme, a 
lipolytic enzyme, a xylanolytic enzyme, a proteolytic enzyme, an oxidoreductase, a 

20 peroxidase, a laccase, a pectlnase, or a cutinase. 

A special embodiment of the present Invention relates to a method for production of a 
protein of interest in a host cell, in which the protein of interest Is a precursor. I.e. a 
zymogen, a hybrid protein, a protein obtained as a pro sequence or pre-pro sequence, 
25 or in unmaturated form. 

General Molecular Biology 

Unless otherwise Indicated, the recombinant DNA techniques utilised in the present 
30 invention are standard procedures, well known to those skilled in the art. Such 

techniques are described and explained throughout the literature in sources such as, 3. 

Perbal, A Practical Guide to Molecular Cloning, John Wiley and Sons (1984), J. 

Sambrook et aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbour 

Laboratory Press (1989), T.A. Brown (editor). Essential Molecular Biology: A Practical 
35 Approach, Volumes 1 and 2, IRL Press (1991), D.M. Glover and B.D. Hames (editors), 

DNA Cloning: A Practical Approach, Volumes 1-4, IRL Press (1995 and 1996), F.M. 

Ausubei et al. (Editors), Current Protocols In Molecular Biology, Greene Pub. Associates 
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and Wiley-Interscience (1988, including all updates until present). Methods in 
Enzymology. Vol 194. Guide to Yeast Genetics and Molecular Biology. (1991) Ed 
Gunthrie and Finl< Academic Press, Methods in l^icrobiology Vol. 26. Yeast Gene 
Analysis. (1998) Ed. Brown and Tuite. Academic Press, Miller, 3. H. (1992) A Short 
Course in Bacterial Genetics (Manual, L., ed). Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY, Johnston, J. R (1994) l^olecular Genetics of Yeast (A Practical 
Approach) Oxford University Press, Oxford., and Molecular Genetics of Yeast: A 
Practical Approach, Ed. J.R. Johnston, IRL Press (1994) and are incorporated herein by 
reference. 



10 



The % identity of a polypeptide is determined by GAP (Needleman and Wunsch, 1970) 
analysis (GCG program) with a gap creation penalty=5, and a gap extension 
penalty=0.3. The query sequence is at least 15 amino acids In length, and the GAP 
analysis aligns the two sequences over a region of at least 15 amino acids. More 

15 preferably, the query sequence is at least 50 amino acids in length, and the GAP 

analysis aligns the two sequences over a region of at least 50 amino acids. Even more 
preferably, the query sequence Is at least 100 amino acids in length and the GAP 
analysis aligns the two sequences over a region of at least 100 amino acids. More 
preferably, the query sequence Is at least 250 amino acids in length and the GAP 

20 analysis aligns the two sequences over a region of at least 250 amino acids. Even more 
preferably, the query sequence is at least 500 amino acids In length and the GAP 
analysis aligns the two sequences over a region of at least 500 amino acids. 



Pitrilysin Subfamily of Metalloproteases 



25 



The pitrilysin subfamily of metalloproteases is characterized by the presence of a 
HXXEH (SEQ ID NO:l) motif. A general review of this subfamily Is provided by Rawlings 
and Barrett (1995). Members of this family include, but are not limited to, S. ceKvisiae 
Cymlp (SEQ ID iMO:4) (Swissprot Accession No. P32898), Schizosacchammyces pombe 

30 CI 19.7 (SEQ ID NO: 5) (Swissprot Accession No. O42908), Clostridium perfringens 
HypA protein (SEQ ID NO: 6) (Swissprot Accession No. Q46205), Borreiia burgdorferi 
protein BB0228 (SEQ ID NO:7) (Swissprot Accession No. 051246), Caenortiabditis 
elegans C05D11.1 protein (SEQ ID NO: 8) (Swissprot Accession No. P48053), E. coli 
protease III (also known as pitrilysin) (SEQ ID NO:9) (Swissprot Accession No. 

35 P05458), rat NRD convertase (SEQ ID NO: 10) (Swissprot Accession No. P47245), 
human insulysin (SEQ ID NO: 11) (Swissprot Accession No. P14735), Arabidopsfs 
tfiaiiana CPE (SEQ ID NO: 12) (Genbanl< Accession No. T03302), human 
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metalloprotease I (in part) (SEQ ID IMO:13) (GenBank Accession No. AAH01150) (the 
full sequence: Swissprot Accession No. O95204), Bacillus subtilis zinc protease ymxG 
(SEQ ID NO: 14) (GenBank Accession No. Q04805), and Mycobacterium tuberculosis 
zinc protease Rv2782c (SEQ ID NO: 15) (GenBank Accession No. 033324). For E. coll 
5 protease III (SEQ ID NO:9) it has been shown that the His residues of SEQ ID NO:l, as 
well as Glu-169, are involved In divalent cation binding whilst the Giu residue flanked 
by the His residues is a catalytic residue. 

A gene encoding a pitrilysin metalloprotease can readily be identified by screening by 
10 hybridization for nucleic acid sequences coding for all of, or part of, the 

nnetalloprotease, e.g. by using synthetic oligonucleotide probes, that may be prepared 
on the basis of a cDNA sequence, e.g. the nucleotide sequences encoding any one of 
the metalloproteases presented as SEQ ID NO's: 4 to 15, in accordance with standard 
techniques. 

15 

Genetic Manipulations 



The host cell of the invention which Is genetically manipulated in order to produce 
reduced levels of the defined metalloprotease may be modified using standard 
20 recombinant DNA technology known to the person skilled in the art. The gene sequence 
responsible for the production of the metalloprotease may be Inactivated or eliminated 
entirely. 

In a particular embodiment, the host ceil of the invention is one genetically 
25 manipulated at the coding or regulatory regions of the metalloprotease gene. Known 
and useful techniques include, but are not limited to, gene knockout, gene disruption, 
random or site directed mutagenesis. Introduction of dominant-negative 
metalloproteases, RNA interference (RNAI) using dsRNA, catalytic nucleic acids (such as 
ribozymes and DNAzymes), and antisense nucleic acids, or a combination thereof. 

30 

Mutagenesis may be performed using a suitable physical or chemical mutagenlzing 
agent. Examples of a physical or chemical mutagenlzing agent suitable for the present 
purpose includes ultraviolet (UV) irradiation, hydroxylamine, N-methyl-N'-nltro-N- 
nitrosoguanidine (MNNG), O-methyl hydroxylamine, nitrous acid, ethyl methane 
35 sulphonate (EMS), sodium bisulfite, formic acid, and nucleotide analogues. When such 
agents are used, the mutagenesis Is typically performed by incubating the cell to be 
mutagenized In the presence of the mutagenlzing agent of choice under suitable 
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conditions for the mutagenesis to tal<e place, and selecting for mutated cells having a 
significantly reduced production of metalloprotease. 

Genetic manipulation may also be accomplished by the introduction, substitution or 
5 removal of one or more nucleotides in the metalloprotease coding sequence or a 
regulatory element required for the transcription or translation thereof. Nucleotides 
may, for example, be inserted or removed so as to result in the introduction of a stop 
codon, the removal of the start codon or a change of the open reading frame. The 
modification or Inactivation of the structural sequence or a regulatory element may be 
10 accomplished by site-directed mutagenesis or PCR generated mutagenesis in 
accordance with methods i<nown In the art. 

A convenient way to Inactivate or reduce the metalloprotease production of a host cell 
Is based on the principles of gene interruption. This method involves the use of a DNA 

15 sequence corresponding to the endogenous gene or gene fragment which It is desired 
to destroy. The DNA sequence is In vitro mutated to a defective gene and transformed 
into the host cell. By homologous recombination, the defective gene replaces the 
endogenous gene or gene fragment. It may be desirable that the defective gene or 
gene fragment encodes a marker which may be used for selection of transformants in 

20 which gene encoding the metalloprotease has been modified or destroyed. 

The term "antisense" as used herein refers to nucleotide sequences which are 
complementary to a specific nucleic acid sequence. Antisense molecules may be 
produced by any method, including synthesis by iigating the gene(s) of Interest in a 
25 reverse orientation to a viral promoter which permits the synthesis of a complementary 
strand. Once introduced into a host cell, this transcribed strand combines with natural 
sequences, in this instance that encoding the metalloprotease, produced by the cell to 
form duplexes. These duplexes then biocl< either the further transcription or translation. 
In this manner, mutant phenotypes may be generated. 

30 

The term "catalytic nucleic acid" refers to a DNA molecule or DNA-contalnIng molecule 
(also known in the art as a "deoxyrlbozyme") or an RNA or RNA-contalning molecule 
(also known as a "ribozyme") which specifically recognizes a distinct substrate and 
catalyzes the chemical modification of this substrate. The nucleic acid bases In the 
35 catalytic nucleic acid can be bases A, C, G, T and U, as well as derivatives thereof. 
Derivatives of these bases are well known in the art. 



wo 2004/027067 



22 



'CT/DK2003/000609 



Typically, the catalytic nucleic acid contains an antisense sequence for specific 
recognition of a target nucleic acid, and a nucleic acid cleaving enzymatic activity, also 
referred to herein as the "catalytic domain". The types of ribozymes that are 
particularly useful In this Invention are the hammerhead ribozyme (Haseloff and 
5 Gerlach, 1988) and the hairpin ribozyme (Shippy et al., 1999). 

Ribozymes useful for the methods of the Invention, and DNA encoding the ribozymes, 
can be chemically synthesized using methods well known In the art. The ribozymes can 
also be prepared from a DNA molecule (that upon transcription yields an RNA molecule) 

10 operably linked to an RNA polymerase promoter, e.g., the promoter for T7 RNA 

polymerase or SP6 RNA polymerase. When the vector also contains an RNA polymerase 
promoter operably linked to the DNA molecule, the ribozyme can be produced in vitro 
upon incubation with RNA polymerase and nucleotides. In a separate embodiment, the 
DNA can be inserted into an expression cassette or transcription cassette. After 

15 synthesis, the RNA molecule can be modified by ligation to a DNA molecule having the 
ability to stabilize the ribozyme and make it resistant to RNase. Alternatively, the 
ribozyme can be modified to the phosphothio analog for use In liposome delivery 
systems. This modification also renders the ribozyme resistant to endonuclease activity. 

20 dsRNA (RNAi) is partriculaHy useful for specifically inhibiting the production of a 
pari:icular protein. This technology relies on the presence of dsRNA molecules that 
contain a sequence that is essentially identical to the mRNA of the gene of interest, in 
this case a mRNA encoding the metailoprotease. Conveniently, the dsRNA is produced 
In a single open reading frame in a recombinant vector or host cell, where the sense 

25 and anti-sense sequences are flanked by an unrelated sequence which enables the 
sense and anti-sense sequences to hybridize to form the dsRNA molecule with the 
unrelated sequence forming a loop structure. The design and production of suitable 
dsRNA molecules for genetic manipulation Is well known within the capacity of a person 
skilled in the art:, part:iculariy considering Waterhouse et al. (1998), Elbashlr et al. 

30 (2001), WO 99/32619, WO 99/53050, WO 99/49029, and WO 01/34815. 

Owing to the genetic manipulation, the host cell of the invention expresses significantly 
reduced levels of the metailoprotease. In a preferred embodiment, the level of 
metailoprotease expressed by the host cell is reduced more than about 25%, such as 
35 more than about 30%, such as more than about 35%, such as more than about 40%, 
such as more than about 45%, such as more than about 50%, such as more than 
about 55%, such as more than about 60%, such as more than about 65%, such as 
more than about 70%, such as more than about 75%, such as more than about 80%, 
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such as more than about 85%, such as more than about 90%, such as more than 
about 95%, such as more than about 98%, and such as more than about 99%. 

In a presently most preferred embodiment, the product expressed by the host cell Is 
5 essentially free of any activity of the defined metalloprotease. 

In the present context, the term "essentially free" relates to a host, wherein the 
metalloprotease expressed by said host cell is reduced to a level, where the function of 
said metalloprotease has no biologically significant reducing influence on the production 
10 of the protein of Interest. 

Protein of Interest 

The terms "polypeptide", "protein" and "peptide" are used herein interchangeably and 
15 in the present context relates to any of the numerous naturally occurring extremely 
complex substances such as but not limited to enzymes or antibodies that consist of 
amino acid residues joined by peptide bonds, contain the elements carbon, hydrogen, 
nitrogen, oxygen, usually sulphur, and occasionally other elements such as but not 
limited to phosphorus or Iron, that are essential constituents of all living cells, that are 
20 in nature synthesised from raw materials by plants but assimilated as separate amino 
acids by animals, that are both acidic and basic and usually colloidal in nature although 
many have been crystallised, and that are hydroiyzable by acids, alkalies, proteolytic 
enzymes, and putrefactive bacteria to polypeptides, to simpler peptides, and ultimately 
to alpha-amino acids. 

25 

As defined herein, a "recombinant polypeptide" Is a protein which is not native to the 
host cell, or a native polypeptide In which modifications have been made to alter the 
native sequence, or a native protein whose expression Is quantitatively altered as a 
result of a manipulation of a native regulatory sequence required for the expression of 
30 the native protein, such as a promoter, a ribosome binding site, etc., or other 
manipulation of the host cell by recombinant DNA techniques. 

Owing to the absence or reduction in activity of the defined metalloprotease, at least a 
portion of the recombinant polypeptides expressed by the host cell may also be a 
35 precursor protein. I.e. a zymogen, a hybrid protein, a protein obtained as a pro 
sequence or pre-pro sequence, or In unmaturated form. 
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In a more specific embodiment, the recombinant polypeptide is of eukaryotic origin, 
sucli as Insulin, adrenocorticotropic hormones, angiotenslnogen, atrial natriuretic 
peptides, dynorphin, endorphines, galanin, gastrin, gastrin releasing peptides, 
neuropeptide Y fragments, pancreastatln, pancreatic polypeptides, secretin, vasoactive 
5 Intestinal peptide, growth hormone releasing factor, melanocyte stimulating hormone, 
neurotensin, adrenal peptide, parathyroid hormone and related peptides, somatostatin 
and related peptides, brain natriuretic peptide, calcitonin, corticotropin releasing factor 
(CRF), cocaine amphetamine regulated transcript (CART), thymosin, urotensin, 
glucagon and glucagon like peptides (GLP-1 and GLP-2), somatostatin. Interferon, a 
10 vascular endothelial growth factor (VEGF), platelet derived growth factor (PDGF), factor 
VII, factor WII, factor V, factor IX, interleukins, urokinase, erythropoietin (EPO), 
chymosin, tissue plasminogen activator, CCK or serum albumin. 

With specific regard to "glucagon and glucagon like peptides", this term as used herein 
15 may refer to polypeptides of human origin or from other animals and recombinant or 
semisynthetic sources and include all members of the glucagon family, such as GRPP 
(glicentine related polypeptide), glucagon, GLP-1 (glucagon like peptide 1), and GLP-2 
(glucagon like peptide 2), Including truncated and/or N-termlnally extended forms, 
such as GLP-l(7-36), and includes analogues, such as GLP-1(7-35)R36A GLP-2 F22Y, 
20 GLP-2 A19T+34Y. GLP2 A2G and GLP-2 A19T, and other analogues having from 1 to 3 
amino acid changes, additions and/or deletions. 

Host Cells and the Expression of Recombinant Polypeptides Therefrom 

25 The host cells for use in the present invention can be prokaryotic or eukaryotic. The 
eukaryotic host cells for use in the present Invention can be, for example, fungal, 
mammalian, plant or Insect cells. Preferably, the host cells are yeast cells. 

In order to produce the desired polypeptide, the host cell of the invention comprises a 
30 nucleic acid sequence encoding the recombinant polypeptide as well as regulatory 
sequences for directing the expression of the desired product such as regions 
comprising nucleotide sequences necessary or e.g. transcription, translation and 
termination. The genetic design of the host cell of the invention may be accomplished 
by the person skilled in the art, using standard recombinant DNA technology for the 
35 transformation or transfection of a host cell. 
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Preferably, the host cell is modified by methods known In the art for the Introduction of 
an appropriate expression cassette in, for example a plasmid or a viral vector, 
comprising the nucleic acid encoding the recombinant polypeptide. The expression 
cassette may be introduced into the host cell by a number of techniques including, but 
5 not limited to, as an autonomously replicating plasmid or integrated into the 
chromosome. 

Expression cassettes may contain regulatory sequences such as transcription control 
sequences, translation control sequences, origins of replication, and other regulatory 

10 sequences that are compatible with the recombinant cell and that control the 
expression of nucleic acid molecules encoding the recombinant polypeptide. In 
particular, recombinant nucleic acid molecules of the present invention Include 
transcription control sequences. Transcription control sequences are sequences which 
control the Initiation, elongation, and termination of transcription. Particularly important 

15 transcription control sequences are those which control transcription Initiation, such as 
promoter, enhancer, operator and repressor sequences. Suitable transcription control 
sequences include any transcription control sequence that can function In at least one 
of the recombinant cells of the present Invention. A variety of such transcription control 
sequences are known to those skilled In the art. Preferred transcription control 

20 sequences include those which function In bacterial, yeast, arthropod and mammalian 
cells, such as, but not limited to, tac, lac, trp, trc, oxy-pro, omp/Ipp, rrnB, 
bacteriophage lambda, bacteriophage T7, T7lac, bacteriophage T3, bacteriophage SP6, 
bacteriophage SPOl, metallothlonein, alpha-mating factor, Pichia alcohol oxidase, 
alphavirus subgenomic promoters (such as Sindbis virus subgenomic promoters), 

25 antibiotic resistance gene, baculovirus, Heliothls zea insect virus, vaccinia virus, 

herpesvirus, raccoon poxvirus, other poxvirus, adenovirus, cytomegalovirus (such as 
Intermediate early promoters), simian virus 40, retrovirus, actin, retroviral long 
terminal repeat, Rous sarcoma virus, heat shock, phosphate and nitrate transcription 
control sequences as well as other sequences capable of controlling gene expression In 

30 prokaryotic or eukaryotic cells. Transcription control sequences of the present Invention 
are most preferably naturally occurring transcription control sequences associated with 
yeast. Suitable promoters for S. cerevisiae include the MFal promoter, galactose 
inducible promoters such as GALl, GAL7 and GALIO promoters, glycolytic enzyme 
promoters including TPIl and PGKl promoters, TRPl promoter, CYCI promoter, CUPl 

35 promoter, PHOS promoter, ADHl promoter, and HSP promoter. A suitable promoter In 
the genus Pichia Is the AOXI (methanol utilisation) promoter. 
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Recombinant polypeptides of the present invention may also (a) contain secretory 
signals to enable an expressed polypeptide to be secreted from the cell that produces 
the polypeptide and/or (b) contain fusion sequences which lead to the expression of 
fusion proteins. Examples of suitable signal segments include any signal segment 
5 capable of directing the secretion of the fusion protein. Preferred signal segments 
include, but are not limited to, tissue plasminogen activator (t-PA), Interferon, 
Interleukin, growth hormone, histocompatibility and viral envelope glycoprotein signal 
segments, the leader sequence originating from the fungal amyloglucosldase (AG) gene 
such as gaiA - both 18 and 24 amino acid versions e.g. from Aspergillus sp., the a- 
10 factor gene of yeasts e.g. from Saccharomyces sp. and Kluyveromyces sp., the P-factor 
of Schizosaccharomyces sp., and the a-amylase gene from Bacillus sp., as well as 
natural signal sequences. 

The cloning vehicle may also comprise a selectable marker, e.g. a gene, the product of 
15 which complements a defect In the host cell, or one which confers antibiotic resistance, 
such as amplclllln, kanamycin, chloramphenicol or tetracycline resistance. 

The procedures used to llgate the DNA construct of the invention, the promoter, 
terminator and other elements, respectively, and to Insert them Into suitable cloning 
vehicles containing the information necessary for replication, are well known to persons 
skilled in the art. 



20 



Recombinant DNA technologies can be used to improve the expression of transformed 
nucleic acid molecules by manipulating, for example, the number of copies of the 

25 nucleic acid molecules within a host cell, the efficiency with which those nucleic acid 
molecules are transcribed, the efficiency with which the resultant transcripts are 
translated, and the efficiency of post-translatlonal modifications. Recombinant 
techniques useful for increasing the expression of nucleic add molecules useful for the 
methods of the present Invention Include, but are not limited to, operably linking the 

30 nucleic acid molecule to high-copy number plasmlds, integration of the nucleic acid 
molecule into one or more host cell chromosomes, addition of vector stability 
sequences to plasmids, substitutions or modifications of transcription control signals 
(e.g., promoters, operators, enhancers), substitutions or modifications of translational 
control signals (e.g., ribosome binding sites, Shine-Dalgarno sequences), modification 

35 of nucleic add molecule to correspond to the codon usage of the host cell, and the 
deletion of sequences that destabilize transcripts. The activity of an expressed 
recombinant polypeptide of the present Invention may be Improved by fragmenting, 
modifying, or derivatlzing polynucleotide molecules encoding such a protein. 
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Methods of Producing Recombinant Polypeptides 

Host cells tiiat have been transfected or transformed with the nucleic acid encoding the 
5 recombinant polypeptide are cultured for a sufficient time and under appropriate 

conditions known to those skilled in the art in view of the teachings disclosed herein to 
permit the production, and preferably secretion, of the polypeptide, followed by 
recovery of the desired product. 

10 Furthermore, owing to the reduced activity of the metalloprotease, the recombinant 
polypeptide expressed by the host cell may be obtained as a precursor protein, i.e. a 
zymogen, a hybrid protein, a protein obtained as a pro sequence or pre-pro sequence, 
or In unmaturated form. 

15 The broth or medium used for culturing may be any conventional medium suitable for 
growing the host cell in question, and may be composed according to the principles of 
the prior art. The medium preferably contains carbon and nitrogen sources and other 
inorganic salts. Suitable media, e.g. minimal or complex media, are available from 
commercial suppliers, or may be prepared according to published protocols. 

20 

With regard to yeast host cells, it is often advantageous to produce heterologous 
polypeptides in a diploid yeast culture, because possible genetical defects may become 
phenotypically expressed In a haplold yeast culture, e.g. during continuous 
fermentation In production scale, and because the yield may be higher. The production 
25 of recombinant polypeptides In yeast host cell Is described In Molecular Genetics of 
Yeast: A Practical Approach, Ed. J.R. Johnston, IRL Press (1994) which is Incorporated 
herein by reference. 

After cultivation, the protein is recovered by conventional methods for isolation and 
30 purification proteins from a culture broth. Well-known purification procedures include 
separating the cells from the medium by centrifugatlon or filtration, precipitating 
proteinaceous components of the medium by means of a salt such as ammonium 
sulphate, and chromatographic methods such as e.g. Ion exchange chromatography, 
gel filtration chromatography, affinity chromatography, etc. 

35 

The present invention is exemplified by demonstrating that the total amount of CCK 
and proBNP is increased about 60% about 100%, respectively when compared to the 
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non-modified host cell. The examples thus demonstrate that the host cells of the 
present Invention are able to Increase the production of diverse proteins. 

Further, the examples disclose that additional disruption of other proteases enhance 
5 the production of the protein of interest, for example disruption of both KEX2 and CYMl 
results in an additive effect in the yield in the production of CCK (nearly 100%). 

Further, it will be understood by the skilled addressee that special amino acids within 
some of the motifs described, particularly His, Glu, Asp or Lys, are essential to the 
10 function of the metalloprotease by functioning as metal llgands. 

Furthermore, it will be recognised that the metalloproteases of the present Invention 
have been annotated widely in the literature as family members of the pitrllysin family, 
insulysin family, insulinase family^ inverzlncin family and M16 subfamily of cfan ME. 

15 

Thus, it will be understood that any feature and/or aspect discussed above In 
connection with any of these different family annotations apply by analogy to the 
metalloprotease described herein, which ail include the HXXEH motif. 

20 However, please note that pitrilysin (without family) In Itself refers to a specific 
member of clan ME of metalloproteases In E. co//. 

Examples 

Materials and Methods 

Yeast strains and growth conditions 

25 

The yeast strains used are listed in Table I. Construction of strains were carried out 
using either the two step gene disruption technique (Rothstein, 1991) or the PGR based 
method by (Brachmann et a!., 1998). Media were purchased from Difco, amino acids 
and other supplements from Sigma-Aldrlch. Yeast cells were grown at 30**C In YPD (1% 
yeast extract, 2% peptone and 2% dextrose) or synthetic complete media (SC) based 
on yeast nitrogen base with ammonium sulfate, succinic add, NaOH and appropriate 
amino acids. Transformations with either linear DNA or plasmlds were performed using 
the modified lithium acetate procedure as described (Gietz et al., 1995). Analysis of 
heterologous expressed CCK was performed from yeast growing in exponential phase 
due to the consistency in CCK-22 biosynthesis, in contrast to the results from yeast 
within the stationary phase (Fig. 2). ProCCK processing was analysed from cell extract 
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and media of 5 Agoo units of cells per ml synthetic complete media. Cell growth was 
followed by the absorbance at 600 nM. 

Table 1. S. cerevisiae strains used In this study. Null mutants of putative 
5 metalloproteases are named by the ORF in the genotype and (*) represents 
mitochondrial proteases. 
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Strain 

BY4705 

UY13 

UY14 

UY15 

BJ2168 

UY21 

UY22 

UY23 

UY122 

UY123 

UY201 

UY202 

UY203 

UY204 

Y15298 
Y11874 
Y10148 (*) 
Y14953 
Y16224 (*) 
Y14984 (*) 
Y17144 
Y13211 
Y13801 



Genotype 

MATa ade2A::hisG h!s3A200 leuZAO lys2A0 ura3A0 

MATa ade2A::hisG his3A200 leuZAO lys2A0 uraSAO 
ypsl::TRPl 

MATa ade2A::hfsG hfs3A200 leu2A0 lys2A0 ura3A0 
ypsl::TRPl yps3::LEU2 

MATa ade2A::hisG hls3A200 leu2A0 lys2A0 ura3A0 

ypsl::TRPl yps3::LEU2 yps2::URA3 

MATa prcl-407 prbl'1122 pep4-3 feu2 trpl ura3'52 gal2 

MATa prcl'407 prbl-1122 pep4'3 teu2 trpl ura3-52 gal2 

kexl::LEU2 

MATa prcl'-407 prbl'1122 pep4-3 Ieu2 trpl ura3-52 gal2 

kexl::LEU2 kex2::TRPl 

MATa prcl-407 prbl-1122 pep4-3 Ieu2 trpl ura3''52 gal2 
kex2::TRPl 

MATa apelr.KANMX ape2::LYS2 his3A0 leu2A0 lys2A0 
ura3A0 

MATa apelr.KANMX ape2::LYS2 ape3::LEU2 his3A0 leu2A0 
lys2A0 ura3A0 

MATa prcl'407 prbl-1122 pep4'3 Ieu2 trpl ura3''52 gal2 
axll::LEU2 

MATa prcl-407 prbl'1122 pep4-3 Ieu2 trpl ura3-52 gal2 
ste24::LEU2 

MATa prcl-407 prbl'1122 pep4-3 Ieu2 trpl ura3-S2 gal2 
prdl:LEU2 

MATa prcl'407 prbl'1122 pep4'3 !eu2 trpl ura3S2 gal2 
yiil08w::LEU2 

MATa hls3Al teu2A0 lys2A0 ura3A0 ste23::KANMX 
MATahis3Al leu2A0 lys2AO ura3AQ aapl::KANMX 
MATahis3Al leu2A0 lys2A0 ura3A0 afg3::KANMX 
MATahls3Al leu2A0 lys2A0 ura3A0 apel::KANMX 
MATa his3Al leu2A0 lys2A0 ura3A0 rcaliiKANMX 
MATahis3Al leu2A0 lys2A0 ura3A0 mipl::KANMX 
MATahls3Al leu2A0 lys2A0 ura3A0 ymel::KANMX 
MATa his3Al leuZAO lys2A0 ura3A0 ybr074w::KANMX 
MATah/s3Al leu2A0 lys2A0 ura3A0 ydll04c::KANMX 



Source 

(Brachmann et 
al., 1998) 

This study 

This study 

This study 
(Jones, 1991) 
This study 

This study 

This study 

This study 

This study 

This study 

This study 

This study 

This study 

Euroscarf 
Euroscarf 
Euroscarf 
Euroscarf 
Euroscarf 
Euroscarf 
Euroscarf 
Euroscarf 
Euroscarf 
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Y11941 


MATa hisJAl leu2A0 lysZAO ura3A0 yhrll3w::KANMX 


Euroscarf 


Y11960 


MATahis3Al leuZAO lysZAO uraSAO yhrl32c::KANMX 


Euroscarf 


Y12296 


MATah!s3Al leuZAO lysZAO ura3A0 yill37c::KANMX 


Euroscarf 


Y15370 


MATahls3Al leu2A0 lys2A0 ura3A0 ynl045w::KANMX 


Euroscarf 




MATa ura3'A851 /eu2-Al hfs3A200 lys2A202 




ykr035C'ykr038c: :URA3 


Euroscarf 


Y11749 


MATa his3Al feu2A0 lysZAO ura3A0 yol057w::KANMX 


Euroscarf 


Y16248 


MATa his3Al leu2A0 lys2A0 ura3A0 yol098c::KANMX 


Euroscarf 


10231B 


MATahis3A200 leu2Al trplA63 ura3'52 




yoll54w(4, 744): :KANMX 


Euroscarf 


Y14266 


MATa his3Al leu2A0 lys2A0 ura3A0 ydr430c: : KANMX 


Euroscarf 


UY430 


MATa prcl'407 prbl-1122 pep4-3 Ieu2 trpl ura3-52 ga/2 


This study 


ydr430c::LEU2 


UY432 


MATa prcl-407 prbl-1122 pep4-3 Ieu2 trpl ura3-52 gal2 


This study 


kex2::TRPl ydr430c::LEU2 
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DNA extrac±ion and amplification 

Yeast genomic DNA was Isolated as described (Phlijppsen et al., 1991). Polymerase 
cliain reaction (PGR) was performed using eitiier Pwo polymerase or the enzyme 
5 cocktail based on Taq, Pwo and Pfu polymerase (Expand long range PGR kit, XL-PGR) 
both from Roche. All PGR products were visualised by agarose gel-electrophoresls and 
PGR products either purified from the gel using the gel-extraction kit (Qiagen) or from 
the reaction mixture by PGR purification spin columns (GENOMED). PGR based one step 
gene disruption was performed using 50 ng of plasmid from the pRS400 series 
10 (Brachmann et al., 1998) as template. Amplification of the marker was performed with 
oligonucleotides having 20 nucleotides towards the plasmid and additional 50 
nucleotides flanking the target gene (Table 2). All other DNA manipulations were 
carried out according to standard procedures (Sambrook et aL, 1989). 

15 Plasmid constructions 

Expression of proGGK was performed in pRS426 [2}i URA3'\ (Brachmann et al., 1998) 
using the phosphoglycerate kinase promoter {PGKlp) and terminator {PGKlt). The 
PGKl promoter was amplified with PGKlp S'Hindlll and PGKlp yMCS (Table 2) using 

20 100 ng of genomic yeast DNA as template and subsequently cloned into pGEM-11 
(Promega) in the HfndlU and Sad restriction enzyme sites. The terminator was 
amplified with PGKlt S'Bgfll and PGKlt 3'SacI (Table 2) and ligated into the plasmid 
containing the promoter at the Sad and EcoRl restriction enzyme sites. This construct, 
pGEM-11 PGKlpMCSPGKlt then contained the PG/Ci -promoter, a multiple cloning site 

25 (MCS) with the restriction enzyme sites EcoRl, BamHl, Xbal and Bglll followed by the 
PGKl terminator. The preproMfalp-proCGK fusion (Rourke et al., 1997) (Fig. 1) was 
subcloned into the EcoRl and Xbal sites of pGEM-11 PGKlpMCSPGKlt and finally the 
entire gene was cloned into pRS423 as well as pRS426 to complete the yeast GCK 
expression constructs, pRS423 preproMfalp-proGGK and pRS426 preproMfalp-proGGK 

30 respectively. Expression of CYMl on a multi copy plasmid was constructed by 

amplification of the open reading frame (ORF) of CYMl and additional 926 bp at the 5' 
end and 703 bp at the 3' end. The amplification was carried out by XL-PGR using 100 
ng of genomic yeast DNA and the oligonucleotides, CYMl 5'Apal and 



35 



Table 2. Oligonucleotides used. 
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Oligo 



Oligonucleotide sequence (5'-3') 



PGKlpS'Hindill 
PGKlpS'MCS 

PGKltS'Bglll 

PGKltySacl 

5'MFal-Eco/?I 

CCK3'-XteI 

CCK-22 K->A (S) 

CCK-22 K-^A (AS) 

CCK-22 K-^R (S) 

CCK-22 K-^R (AS) 
CCK-22 seq 
CYMlS'Apal 

CYM13'Xhol 

KEX2S' 

KEX23' 

KEX2DCS' 
KEX2DC3' 

TRPlS'Ndel 
TRP13'Avrll 



AATAGAAGCTTGTCGACTGATCTATCCAAAACTG 
(SEQ ID NO: 16) 

AAAAGAGCrCGGCCAGATCTTCTAGAGGATCCAA 
GAATTCTGTTTTATATTTGTTGTAAAAAGTAG 
(SEQ ID NO: 17) 

TTTTGAATTCCAAGATCrCCCATGTCTCTACTGGTGG 
(SEQ ID NO: 18) 

CCCCGAGCTCGTCGACCCTTCTCGAAAGCTTTAACGAAC 
GC (SEQ ID NO: 19) 

TTTTGAATTCAAAGAATGAGATTTCCTTCAATTTTTACTG 
CAG (SEQ ID NO: 20) 

I I i M CTAGACTAGGAGGGGTACTCATACTCCTCGGC 
(SEQ ID NO: 21) 

CGAATGTCCATCGTTSCSAACCTGCAGAACCTG 
(SEQ ID NO: 22) 

CAGGTTCTGCAGGTT£CrAACGATGGACATrCG 
(SEQ ID NO: 23) 

CGAATGTCCATCGTTASSAACCTGCAGAACCTG 
(SEQ ID NO: 24) 

CAGGTTCTGCAGGTTCCIAACGATGGACATTCG 
(SEQ ID NO: 25) 

TCGCAGAGAACGGATGGC (SEQ ID NO: 26) 

TTTTGGGCCCTTCATGGTGATACGGTATCrCTTGGC 
(SEQ ID NO: 27) 

TTTTCTCGAGAAGGTGGAACATACTGCCCTGGGATGG 
(SEQ ID NO: 28) 

TTTTGAGCTCGTrTAGGAAACGTCCTTGGCGGAGATGC 
(SEQ ID NO: 29) 

I I I I ICTAGACACTGCGAATCCATGGTATAAACCAAAACC 
(SEQ ID NO: 30) 

GTCGTTGTTCATGGACATACCTCC (SEQ ID NO: 31) 
TACAAATGTTCTTCTGCCATTTCTGG (SEQ ID NO: 32) 

GGTTCATATGCGCCGGAGCrCCTCGACAGCAG 
(SEQ ID NO: 33) 

GGTTCCTAGGATCCGCAAGTTTGATTCCATTGCGGTG 



Purpose 



Expression 
construct 

Expression 

construct 

Expression 

construct 

Expression 

construct 

preproMfalp-proC 

CK 

preproMfalp-proC 
CK 

Lys^'^AIa^^ 

mutation 

Lys"->Ala^^ 

mutation 

Lys^i-^Arg^^ 

mutation 

Lys^i-^Arg®^ 

mutation 

Sequencing 
Cloning of CYMl 

Cloning of CYMl 

Cloning of KEX2 

Cloning of KEX2 

Control of A/fex2 
Control of A/rex2 

Cloning of TRPl 
Cloning of TRPl 



wo 2004/027067 



PCT/DK2003/000609 



/CEXI5'GD400 

KEX13'GD400 

KEX15*DC400 
KEXiyDC400 

Cmi 5*GD400 

CyWJ3'GD400 

Cmi5*DC400 
Cmi3'DC400 



34 



(SEQ ID NO: 34) 

TTAAAGAGTACCTTGGCTATAGAATACCGTAGAGATAAA 

GACCTGAATAGAGATTGTACTGAGAGTGCAC 

(SEQ ID NO: 35) 

AGGTATTATAACTATrrrrCTGT AI Mil lA TAT At I 11 iA T 

TTGCCAAGCTGTGCGGTATTTCACACCG 

(SEQ ID NO: 36) 

CTTTGGTTAAAGAGTACCTTGGC (SEQ ID NO: 37) 
TACTACGAAAAGCGTGTGCGAGG (SEQ ID NO: 38) 
TAGAAGGCTACTCAAAAGAATAAAGTrACTATAAAATATA 
CTGCGGTATATAGATTGTACTGAGAGTGCAC 
(SEQ ID NO: 39) 

GATCGGCTWVGAAACTTTGAAGCAGTATATTTACAGGATT 

AAATTATATATCTGTGCGGTATTTCACACCG 

(SEQ ID NO: 40) 

CGGAGGGGCTCTATGATAAAGG (SEQ ID NO: 41) 
GAGTAACTAGGGCTTCTCTTCCC (SEQ ID NO: 42) 



KEXl deletion 

KEXl deletion 

Control of Akexl 
Control of Akexl 

CYMl deletion 

CYMl deletion 

Control of Acyml 
Control of Acyml 



CYMl 3'Xhol (Table 2). The PGR product was purified on spin columns and 
subsequently cloned into the Apal and Xhol restriction enzyme sites of pRS425. 

5 

The Lys^^ residue, believed to be crucial for the proteolysis of proCCK to release 
CCK-22, was exchanged by Ala by site-directed mutagenesis (Norton et al., 1993). The 
exchange was performed by PCR using the Pwo polymerase (Boehringer Mannheim), 
where two products were amplified with the oligonucleotides sets, PGKlpS' Hindlll / 

10 CCK-22 K-^A (antisense) and CCK-22 K->A (sense) / PGKltS' Sad (Table 2) and 50 ng 
of pRS426 preproMfalp-proCCK as template to each reaction. The two products were 
subjected to agarose gei-electrophoresis and approximately 1 mm^ of each product 
where cut out and used directly as template in a third PCR reaction. In this reaction the 
full-length cDNA encoding the fusion protein was amplified using PGKlpS' Hindlll and 

15 PGKltS' Sad (Table 2). The PCR product was subcloned into pCR-Blunt II (Invitrogen) 
and sequenced with the CCK specific primer, CCK-22 seq. Finally the PGKlp 
preproMfalp-proCCK (K->A) PGKlt product was cloned Into the Hfndlll and Sad sites 
of pRS426 to construct the expression plasmid, proCCK (K^A). Substitution of Lys with 
Arg was performed as described above by exchanging the CCK specific primers with 
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CCK-22 K-»R (antisense) and CCK-22 K->R (sense) to construct the proCCK (K-^R) 
vector. 



10 



Strain construction 

Construction of a partial KEX2 disruption was performed in BJ2168 by amplification of 
tlie entire KEX2 gene with 1000 bp on each site of the ORF by XL-PCR using 100 ng of 
genomic yeast DNA and the oligonucleotides, KEX2 S'Sacl and KEX2 3'Xbal (Table 2). 
The PGR product was purified on spin columns and cloned into pCR-Blunt II 
(Invltrogen). Amplification of TRPl was performed by XL-PCR introducing an Ndel site 
925 bp 5' to the ORF and an Avrll site 212 bp 3' to the stop codon using TRPl 5'Ndel 
and TRPl 3'AvrII (Table 2). The PGR product was purified and subcloned into the Ndel 
and Avrll sites of KEX2 eliminating 2018 bp of KEX2 and 170 bp of the promoter. The 
kex2::TRPl construct was excised from pGR-Blunt II using the Notl and Spel restriction 
15 enzymes and subsequently transformed Into BJ2168. Transformants were selected on 
SG-Trp plates followed by a colony PCR screen to test for correct integration using 
oligonucleotides that cover the entire marker plus an additional 1200 bp on each site of 
KEX2, kex2 DCS' and kex2 DCS' (Table 2). Construction of a kex2 kexl strain was 
performed by the two step gene disruption technique (Rothstein, 1991) using the LEU2 
20 marker. Amplification of LEU2 was performed by XL-PCR using 50 ng of pRS405 as 
template and AcexI5'GD400 and A:exi3'GD400 (Table 2). The PCR product was purified 
using PGR purification spin kit (GENOMED) and subsequently transformed into UY23. 
Transformants were selected on SG-Leu plates and correct integration was tested by 
PCR-based colony screen using ArexI5'DG and kexl3'DC (Table 2). 

25 

A Acyml::LEU2 (UY430) strain in a B32168 background was constructed by the one 
step gene disruption technique as described above for the Akexl strain using the 
oligonucleotides, Cmi5-GD400 and cmi3'GD400 (Table 2) for gene disruption and 
cymlS'DC and cyml3'DC (Table 2) for disruption control. All null mutants created by 

30 this method were prepared with oligonucleotides designed towards the 50 bases 
adjacent to the 5' and 3' UTR with a specific 3' end to the pRS400 series of vectors 
containing various markers (Brachmann et al., 1998). Transformants were selected on 
appropriate agar plates followed by a colony PCR screen to test for correct Integration 
using oligonucleotides that cover the entire marker plus an additional 200 bp on each 

35 site. Only the oligonucleotides that are not positioned as described above are shown In 
Table 2. 
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Gene deletions of STE24, AXLl, PRDl and yjLiOSiv were made in B32168 using the 
PGR disruption teciinlque (Braclimann et al., 1998) and pRS405 [LEU2'\ as tennplate. 

The LJY123, which contain gene deletions of APEl, -2 and -3, was derived from Y14953 
5 using PGR disruption technique (Brachmann et al., 1998). APE2 was initially replaced 
with the LYS2 (pRS317 {cen; LyS2]) where the PGR product was purified from agarose 
gel prior to transformation and APE3 was substituted with the LEU2 marker (pRS405 
ILEU2^). 

10 The ypsl yps2 ypsS triple mutant (UY15) was constructed in BY4705 using the PGR 
disruption technique (Brachmann et al., 1998). The ORF of YPSl were initially deleted 
by insertion of the TRPl locus (pRS404) to generate UY13. This strain was then used 
as host for the deletion of YPS3 by Insertion of the LEU2 marker (pRS405) and finally 
the YPS2 was deleted by insertion of the UF(A3 marker by amplification of pRS406 

15 iURA3'\ to construct UY15 (Table I). 

GGK and CYMl expression 

Human proGGK was expressed as a fusion protein between the prepro leader sequence 
of yeast a-mating factor and proGCK (preproMfalp-proGGK). The fusion construct was 

20 expressed on multi-copy plasmids, with constltutively gene transcription from the 
phosphoglycerate kinase promoter. "'ProGGK expression" refers to expression using 
pRS426 preproMfalp-proGGK, which was used in all yeast strains with exception of 
BY4705 and the Isogenic yapslns deletion strains, where proCGK was expressed from 
pRS423 preproMfalp-proGGK. CYMl expression was driven by its own promoter. 

25 Plasmid constructs, and oligonucleotides used are listed In Table 2. 

Enzymatic treatment 

Trypsin treatment was performed using 1 mg/ml Trypsin (Worthlngton Biochemical 
30 Gorporatlon) in a 50 mM sodium phosphate buffer (pH 7.5) for 30 min at RT and 

terminated by Immersion Into boiling water for 10 min. Garboxypeptidase B (Boehringer 
Mannheim) treatment with a final concentration of 4 \xg/xn\ was performed in 0.1 mM 
sodium phosphate buffer (pH 7.5) at room temperature for 30 min. The reaction was 
terminated by Immersion Into boiling water for 10 min. 
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Gel chromatography 

Yeast transformants grown to late exponential phase were centnfuged at 15000 g to 
5 collect the ceils and 500 ^il of the medium was loaded directly onto a Sephadex G-50 
superfine (Pharmacia) column (1x100 cm) at 4«C. The sample was eluted in VBA buffer 
(20 mM barbital buffer, 0.11% bovine serum albumin and 0.6 mM thiomersal) at a flow 
rate of 3.5 ml/h and fractions were collected every 17 min. Calibrations were performed 
by including "^I-albumin (Vo) and ^^NaCl (V^). The elution constants K^, of peaks eluting 
10 at Ve are calculated as /C^ = {Ve-Vo)/(VrVo). 

Radio-lmmunoassay 

Two different antlsera were used to determine the amount of processed 
15 cholecystokinin. Ab 89009 (Paloheimo et al., 1994) Is specific for the N-terminus of 
CCK-22 and Ab 7270 (Hilsted et al., 1986) Is specific for Gly-extended CCK. The 
fraction of CCK processed to CCK-22 Is calculated by division of the immuno-reactivlty 
measured with Ab 89009 with the amount measured with the same antibody aflier the 
sample was treated with trypsin to measure the total amount of N-terminal extended 
20 CCK-22. 

Yeast extract and protease assay 

Ten Aeoo units of yeast cells growing in exponential phase were sedimented by 
25 centrifugatlon at 3000 g for 5 min, washed once in 25 ml H2O and transferred to a 2 ml 
Eppendorf tube. An equal amount of acid washed glass beads (Sigma-Aldrich) was 
added followed by 200 pi of 0.1 M NaH2P04 (pH 4.5) Including various inhibitors (150 
HM Bestatin, 30 ^M E-64, 10 [xM Leupeptin, 1 Pepstatin A, 1 mM 
phenylmethylsulfonyl fluoride (PMSF), 1 mM EDTA and 1 mM 1,10-orthophenanthrollne 
30 or 1 tablet complete Inhibitor with or without EDTA per 2.5 ml 0.1 M NaH2P04 

(Boehringer Mannheim)). The cells were broken by vortexing 3x20 sec and the extracts 
were clarified by centrifugatlon at 15000 g for 10 min. All steps were carried out at 4«C. 
The protease assay was performed using 20 pmol synthetic amidated CCK-33 
(Peninsula Laboratorle Europe, Merseyslde, England) or Ac-CCK-33-Gly (Cambridge 
35 Research Biochemlcals, Stockton, England) as substrate, 20 >al yeast extract, various 
Inhibitors and activators in a total volume of 30 ^1. The mixture was incubated at 30^C 



wo 2004/027067 PCT/DK2003/000609 

38 

for 1 h and the reaction terminated by adding 500 ii\ VBA buffer followed by immediate 
immersion into a boiling water bath for 10 min. 

Protease assay using metalloprotease deficient strains 

5 

The assay was performed as described above, but with addition of 1 ml^ Bestatin and 1 
ml^ Mn^* to decrease N-terminal degradation. 

Protease assay using intact yeast cells 



10 



Five Asoo units of exponential growing cells where sedimented, washed once in 5 ml H2O 
and once in SC media (pH 6.0), before the cells where resuspended in 25 ^1 SC media. 
The protease assay was performed by addition of 20 pmol synthetic Ac-CCK-33-Gly as 
substrate and the mixture Incubated with gentle shaking at 30»C for 1 h. The reaction 
15 was terminated by addition of 500 td VBA and the cells removed by centrifugatlon 
before the supernatant was immersed into boiling water for 10 min. 

Analysis of secreted CCK by MALDI-TOF 

20 Fifty Aeoo units of CCK transformed yeast cells were subjected to 25 ml of fresh media, 
followed by inoculation for 3 h. Cells were removed by centrifugatlon at 15000 g for 10 
min and 500 mI of media was concentrated and desalted by reverse phase using a 
ZipTIp Ci8 column (l^illipore). The peptides were eiuted with 10 pi 50% acetonitrile. The 
purified peptides were analysed in a l^atrix Assisted Laser Desorption/Ionization time- 

25 of-flight mass spectrometer (BIflex, Bruker-Franzen, Bremen, Germany) operated in 
the reflected mode using time lag focusing (delayed extraction). For analysis, 0.5 pi of 
the sample was mixed with 0.5 M matrix solution (a-cyano-4-hydroxyclnnamic acid in 
acetonltrile/methanol, Hewlett Packard). Then 0.5 pi of the mixture was applied to the 
probe and allowed to dry before Introduction Into the mass spectrometer. 

30 

Statistical analysis 

Statistical calculations were performed using an unpaired students t-testto analyse 
whether the change In proCCK expression or the fraction of mature CCK-22 between 
35 wild type yeast expressing proCCK and mutants isogenic to the wild type strain can be 
considered to be statistically significant. The P-value calculated for CCK-22 processing 
between yapsin mutants are comparisons of BY4705 and each mutant, whereas the 
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brackets represent comparisons between BY4705 ypsl and BY4705 ypsl yps3 and, 
BY4705 ypsl yps3 and BY4705 ypsl yps2 yps3, respectively. 

Expression of proBNP In Saccharomyces cerevfsiae - Construction of the ypsl mutant 
5 Cloning ofpreproBNP 

Messenger RNA was Isolated from a 500 mg Biopsy from human heart using the 
Quickprep Micro mRNA purification Kit (Amersham Pharmacia Biotech). First strand 
cDNA was prepared from 2 \xq mRNA in a reaction containing, 2.5 nl lOx first strand 
10 buffer (Promega), 2.5 ^1 100 mM DTT, 2.5 |il 10 mM dNTP, 2.5 \x\ Na pyrophosphate, 10 
pmol Oligo{dT)i3, 10 units reverse transcriptase, AMV (Promega), and H2O to 25 inl. 
Messenger RNA and Oligo(dT)i8 was heated to 70*^0 for 5 min cooled on ice for 5 min 
prior to cDNA synthesis. The first strand cDNA syntesis was performed at 42^C for 60 
min. 

15 

The cDNA encoding preproBNP was ampified using Pwo polymerase (Roche), 1 \x\ 1. 
strand cDNA, 5 ^il lOx Pwo buffer Included MgCl2 (Roche), 5 ^1 2.5 mM dNTP, 30 pmol 
of each primer {BUPS'EcoRl and BNPS'Xbal). The PGR product encoding preproBNP was 
cloned in pBIuescript II (Stratagene). All subsequent PGR reactions were performed as 
20 described above. 

The fusion between the cDNA's encoding the preprosequence of the a-mating factor 
and proBNP was performed using overhang extension PGR, where two PGR reactions 
were set up. One using 50 ng of pRS426 preproMfalp-proCGK as template, 

25 MFal5'Eco/?I and MFIBNP (AS) as primers and a second with 50 ng of preproBNP 
cloned in pBIuescript and the primers, MFIBNP (S) and BNPS'XZ^al. In the third PGR 
reaction, approximately 50 ng of each PGR products were purified from agarosegel from 
the the two initial PGR using the gel-extraction kit (Qiagen) and used as template with 
the two oligonucleotides, MFal5'Eco/?I and BHP3'Xbal. The preproMfalp-proBNP 

30 encoding construct was subcloned in pGR-Blunt II (Invltrogen) and sequenced with 
vector specific oligonucleotides prior to subcloning Into the EcoRl and Xbal sites of 
pGEM-11 PGKlpMCSPGKlt. Rnally the entire gene was cloned Into pRS426 to complete 
the yeast proBNP expression constructs, pRS426 preproMfalp-proBNP. 

35 Furthermore, two additional constructs have been made, in which the proBNP fragment 
(1-76) has been removed. These constructs are similar to do the preproMfalp-proBNP, 
but do only syntheslse BNP-32. In the first construct, the Kex2p cleavage site and the 
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spacer peptide of the preproMfalp has been sustained (KREAEA)( Figure 14B), whereas 
In the other construct, the spacer peptide has been removed (Figure 14C)- Analysis of 
the BNP-32 expression from wild type yeast and the Isogenic CYMl disruptant will be 
analysed by RIA's using the ShIonoria-BNP system from Electra-Box Diagnostica ApS. 
5 This assay Is specific for BNP-32, 

Expression of proCCK, proBNP and Cymlp 

Human proCCK was expressed as a fusion protein between the prepro leader sequence 
10 of yeast a-mating factor and proCCK (preproMfalp-proCCK). The fusion construct was 
expressed on multi-copy plasmlds, with constltutlvely gene transcription from the 
phosphoglycerate kinase promoter. ''ProCCK expression" refers to expression using 
pRS426 preproMfalp-proCCK, which was used In all yeast strains with exception of 
BY4705 and the isogenic yapslns deletion strains, where proCCK was expressed from 
15 pRS423 preproi^lfalp-proCCK. Human proBNP was also expressed as a fusion protein 
between the prepro leader sequence of yeast a-matIng factor and proBNP 
(preproMfalp-proBNP) (Figure 14A). CYMl expression was driven by its own promoter. 
Plasmid constructs, and oligonucleotides used are listed In Table 2. 

20 BNP radioimmunoassay 

Antibody 98192 is specific for the N-terminus of proBNP (G0tze et aL, 2002). 
Chromatograptiy 

25 FPLC chromatography was performed on a Superdex 200 column on a Akta purifier 

(Amersham Pharmacia Biotech). In the 50 mM Na-phospate buffer, 100 mM NaCI and 6 
M Guanidin were included. 

Strain construction 

30 

AAypsl::TRPl (UY440) and a Acyml::LEU2 6ypsl::TRPl (UY431) strain in a BJ2168 
background were constructed by the one step gene disruption technique as described 
above using the oligonucleotides, yPSI5'GD400 and yPSi3'GD400 (Table 2) for gene 
disruption. The PCR product was transformed into BJ2168 and IJY430. Verification of 
35 the correct Integration of the disruption cassete was analysed by PCR using ypslS'DC 
and ypsiyoc (Table 2). 
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Identification of the gene encoding the Cyml orthologue In PIchia pastorfs or Pfchfa 
methanolica 

Identification of the unknown genes from Pichfa pastorfs and Pichia methanolica 
5 encoding the Cymlp orthologues of Saccharornyces cerevis/ae will be carried out in 
similar manner, using the same set of degenerated primers mentioned below. Pichfa 
pastoris and Pichia methanolica will be refered to as Pichia In the following text. 

By alignment of the orthologous Cyml proteins of Saccharomyces Iciuveri and 

10 Schizosaccharomyces pombe to Cymlp from Saccharomyces cerevisiae, there was 
Identified a number of identical amino acid sequences. From these sequences It Is 
possible to syntesize degenerated oligonucleotides (Table 3) that will bind to the 
complementary DNA strands of CYi^l in all three species, and thus to the CYi^l gene of 
Pfchfa, Amplification of the genomic sequence will initially be carried out by using high 

15 quality genomic DNA as template, Pfchia-CYf^l-la and Pfchfa-CYMl'lh and the Pwo 
polymerase (Roche). The amplified sequence with an expected size of approximately 
525 bp will be cloned In pBlunt or a similar vector and sequenced with vector specific 
primers. If no band appear from the Initial amplification, a second round of PCR will be 
performed with the two nested primers, Pichia-CYi^l-Ua and Pichia-CYi^l-Ub using 1 ^1 

20 of the first PCR reaction as template. The expected product is approximately 270 bp 
and will be cloned in pBlunt and sequenced with M13 foreward and M13 reverse 
primers. From the obtained sequence there will be synthesized sequence specific 
primers, two nested sense and two nested antisense specific primers. Using one of the 
sense primers it is possible to obtain a PCR product with Pichia-CYM l-lllb using high 

25 quality genomic DNA as template. This product of '-2100 bp will be cloned and 

sequenced. If it falls to produce a band of '^'2100 bp, it would be nessessary to isolate 
mRNA from Pichia, produce double stranded cDNA and ligate adaptors to the ends as 
described by the manual to the Clontech Marathon cDNA Amplification Kit ( BP (Becton, 
Dickinson and Cpmpany)). Using the two sequence specific sense primers It is possible 

30 to obtain the 3' end of the mRNA of approximately 2600 bp and the sequence specific 
antisense primers to amplify the 5' end Including the sequence encoding the 
hypothetical active site, HXXEH motif. Synthesis of sequence specific oligonuleotldes 
from the 5' and 3' untranslated region, full-length cDNA encoding the Cyml orthologue 
in Pichia can be cloned. 
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Amino acid sequence 
Ollgo Pichia-CYMl 'la 



K Y P V R D P 
5' AARTAYCCXGTXMGXGAYCC 3' 



Amino acid sequence 
Oligo Pichfa-CYMl-lb 



H P S N A K 
3' GTRGGXWSXTTRCGXTTY 5' 



Amino acid sequence 
10 Oligo Pichia-CYMl-lla 



D P F F K M 
5' GAYCCXTTYTTYAARATG 3' 



Amino acid sequence 
Oligo Pichla-CYMl'Ilb 



G V V Y N E M 
3' CCXCAXCAXATRTTRCTYTAC 5' 



15 Amino acid sequence E K G G A Y G 

Oligo Pichia-CYMl'UIb 3' ctyttyccxccxcgxatrcc 5' 

X-Inosine, degenerated oligonucleotides follow the International Union of Biociiemistry 
fhttp://www.cliem,QmuLac.uk/lubmb/m isc/naseq.htmh . 

20 

Genedisruptlon of Cyml ortliologue in Pichia pastoris or Pichia methanolica 

The sequence encoding the Cyml orthologue in Pichia should be doned in a vector like 
pBluescript-II in Hindlll and Sad or a similar vector, if these are not present in the 

25 ORF. Insertion of the ORF in Hindlll and Sad sites removes most of the multiple 

cloning sites from the vector, which ease the possibility to find restriction enzyme sites 
that are only present In the ORF. Cloning of the Kani^X casette within the ORF, 
preferentially so that 1000 bp of the Pichia CYi^l are present on each site of the Kani^X 
cassette creates the Pichia CYl^l disruption cassette. This construct can then be 

30 amplified by PCR, using primers at specific for the 5' and 3' end of the Pichia CYi^l 
gene. Transformation of the PCR product into strains of Pichia followed by selection of 
transformants on YPD plates containing 100 ^g/ml geneticin (G-418). Verification of the 
correct Integration Into the Pichia genome should be tested by colony PCR, using Pichia 
sequence specific CYl^l primers that binds 5' and 3' to the Kani^X cassette. From the 

35 size of the PCR product it is possible to distinguish whether the integration event Is 
correct. 
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Expression of foreign proteins and peptides In Pichia pastoris 

For expression of peptides one could use the expression vector, pPICa (inducible 
expression) or pGAPZa {constitutive expression) botli from Invitrogen. Both these 
5 vectors use the preprosequence of the a-mating factor from Saccharomyces cerevisiae 
to direct the fusion peptide through the secretory pathway. Within the Golgi apparatus 
the preprosequence of the a-matIng factor Is removed and the peptide of Interest 
released to the media. If it's proteins that should be expressed, both vectors metioned 
above can be used without the preprosequence of the a-matIng factor (pPICZ and 
10 pGAPZ), where the heterologus expressed protein Is cytosollc located and can be 
Isolated from Intact cells. 

Expression of foreign proteins and peptides in Pichia methanol/ca 

15 Expression of proteins and peptides in Pichia methanolica Is performed In a similar 
manner as in Pichia pastoris, where plasmids are avaiable both for /ntracellular 
expression and for secretion to the media. Jnfracellular expressed proteins can be 
cloned into pl^ET (Invitrogen) and for secretion in pl^lETa (Invitrogen). The pI^ETa 
contain the preprosequence of the a-mating factor from Saccharomyces cerevisiae as 

20 used for expression In Pichia pastoris. Expression is induced by methanol in this 
system. 

Results 

The Influence of growth conditions on the CCK-22 processing 

25 

The Intra- and extracellular fraction of CCK-22 was measured from BJ2168 expressing 
proCCK. The Intracellular fraction remained unaltered whether the cells were In 
exponential growth or had reached stationary phase (Fig. 2). However, the relative 
amount of secreted CCK-22 changed dramatically when the cells reached stationary 
30 phase. During exponential growth the fraction of CCK-22 was 23%, but in the 

stationary phase (after 270 min) the fraction increased to 37% (Fig. 2). Hence, for the 
experiments described herein only exponentially growing cells were used. 
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The significance of the Lys residue In the release of CCK-22 

To evaluate the role of the Lys residue In proteolysis, transformants of BJ2168 with the 
5 two expression constructs, proCCK and proCCK (K-^A) were grown to late exponential 
phase and the culture media collected. The media from each strain was subjected to gel 
chromatography and the content of Gly-extended CCK In the collected fractions where 
measured with Ab 7270. CCK from the wild type media etuted in two major peaks at Ka 
= 0.8 and 1.1 (Fig. 3 A) in accordance with the previously established elution positions 

10 for CCK-22-Gly and CCK-8-Gly (Cantor et al., 1987; Rourke et al., 1997), while the 

proCCK (K^A) only gave rise to CCK-8-Gly and a larger form eluting at a /C^ = 0.6 (Fig, 
3 C), The two peaks eluting at == 0.7 and 0.8 for the wt construct (Fig. 3 B) 
correspond to C-terminally extended CCK-22 and CCK-22-Gly, respectively (Rourke et 
al., 1997), whereas no CCK-22 Immuno-reactlvlty was observed In these positions for 

15 the proCCK (K->A) construct (Fig. 3 D). However, a small peak of immuno-reactlvlty 
was seen at = 0.55 which may be due to the sllpht cross reactivity of Ab 89009 with 
a larger unprocessed fragment (Palohelmo et a!., 1994). To investigate the effect of 
substituting Arg for Lys, proCCK (K->-R) was transformed into B32168. Media from 
transformants where analysed before and after tryptic cleavage. The fraction of proCCK 

20 processed to CCK-22 was similar to that seen for wild type CCK (Fig. 11). 

Analysis of secreted CCK peptides by mass spectrometry 

Media collected from BJ2168 transformed with proCCK were analysed by mass 
25 spectrometry (Figure 12). The fragments obtained correspond to the processing leading 
to CCK-39, CCK-22 and CCK-8. Two peptides were identified N-terminal of Lys^^ (Tyr"*^- 
VaP, 1805.0 Da and Tyr^^- Lys^S 1932.2 Da). It appeared likely that the former was a 
carboxypeptldase degradation product of the latter. To elucidate this question and In an 
attempt to Identify the C-termlnal extended CCKs, the present inventors produced a 
30 disruption strain in which both KEX2, encoding the serine protease responsible for the 
processing to CCK-8, and the carboxypeptldase encoded by KEXl were mutated. 
Following transformation of proCCK into this kex2 kexl strain (LJY22) and subsequent 
analysis of the secreted peptides the inventors found only the peak corresponding to 
Tyr'^^-Lys^^. The same pattern, with only the peak corresponding to Tyr'*^-Lys^^ was 
35 seen using single gene disruption of KEXl and KEX2 to express proCCK (data not 

shown). Thus the Tyr''^-Val®° must be a degradation product in accordance with CCK-22 
arising from cleavage after Lys^^. Additional fragments were discovered by CCK 
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expression in the kex2 kexl strain corresponding to processing leading to CCK-61 (not 
identified in mammals), CCK-58, C-terminal extended CCK-39 and C-terminal extended 
CCK-22 (Figure 12), whereas none of the peptides corresponding to CCK-8 could be 
Identified, In accordance with our previous worl< showing that Kex2p is responsible for 
this processing (Rourke et al., 1997). 

Kex2p is Involved In the biosynthesis of CCK-22 

Previous analysis of CCK peptides secreted from a kex2 strain as well as the results 
obtained by mass spectrometry Indicate that the cleavage at Lys" releasing CCK-22 
can occur without the involvement of Kex2p. However, the kex2 strain shows a 
decrease in CCK-22 concentration. ProCCK was expressed both In the vacuole protease 
deficient and the Isogenic kex2 strain (B32168 and UY23) and the processed intra- and 
extracellular fractions of CCK-22 from exponentially growing cells wer« measured. 
15 Approximately 28% of the intracellular CCK content was processed after Lys" in 

BJ2168, whereas only 6% was processed within the Arex2 strain. Analysis of secreted 
CCK peptides showed that the media collected from BJ2168 expressing proCCK 
contained approximately 20% CCK-22, whereas from the kex2 mutant, the amount was 
reduced to 5%. These results indicate that Kex2p Is Involved in the processing leading 
to CCK-22. However, there are other proteases that can perform the cleavage at Lys". 



20 



In vitro assay of Lys®^ cleavage 

To Investigate the nature of the protease(s) in addition to Kex2p that are able to 
25 perform the endoproteolytic cleavage after the single Lys" residue of proCCK, an in 
vitro assay was established using crude preparations frx>m of S. cerevisiae and 
synthetic CCK-33 as substrate. 

Using extract from the vacuole protease deficient strain, BJ2168, there was an 
30 extensive N-termlnal degradation, and the recovery of measurable CCK was less than 
10% of the control without yeast extract. Because the assay depends on the intact 
N-termlnus of CCK-22 for the antibody to bind, the inventors created a strain where 
some of the Icnown S. cerevisiae aminopeptldases were deleted. The Y14953 strain 
(apei) was used as parental strain In which the APE2 and APE3 genes were also 
35 deleted. Using this UY123 strain to prepare the cell extract there was a 2-3 fold better 
recovery of Immuno-reactivity compared to the recovery seen with BJ2168. 
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Processing to CCK-22 depends on metal ions 

Tine nature of the protease performing the cleavage of synthetic human CCK-33 to 
CCK-22 was analysed by inclusion of a number of different inhibitors with the extract 
5 from UY123. The results showed that only the addition of a metal chelating agent 
inhibited proteolysis of CCK-33 to CCK-22 (Fig. 4 A). 

The metal dependency of the protease was tested In vitro, after the activity Initially was 
inhibited by addition of 1 mM EDTA. Reconstitution of the activity leading to maturation 

10 of CCK-22 was tested by addition of different divalent cations in 0.2 mM surplus. 

Addition of Zn^*, Co^"" and Mn^* could reestablish the protease activity, whereas Ca^*, 
Cu^* or l^g^* had no effect (Fig. 4 B) in accordance with the properties of known 
metalloproteases, which are only activated by Zn^+, Co^* and Mn^*. Reactivation using 
increasing Zn^* concentrations showed a biphasic pattern, with Zn^* acting inhibitory at 

15 concentrations above 5 mi^ (data not shown). 

The time course of CCK-cleavage by Zn^* and l^n^* reactivated metalloproteases was 
analysed using cell extract from UY123, after initial inhibition with 1 ml^ EDTA. 
Reactivation was performed by addition of 1.2 mM Zn^* or Mn^^ followed by incubation 

20 for 30, 60 and 120 min. In this assay and the following in vitro protease assays the 
inventors used the N-terminal acetylated CCK-33-Gly (Ac-CCK-33-Gly) as substrate, 
which resulted in much slower non-specific degradation. Measurement of the CCK-22 
immuno-reactivity before and after tryptic cleavage using Ab 89009 showed no 
difference in the activation potency between Zn^^ and Mn^+ at 30 and 60 min, however 

25 after 120 min 10% more CCK-22 immuno-reactivity was measured using Mn^* as 

activator compared to Zn^* (Fig. 5). This increase in Immuno-reactivity is probably due 
to an inhibition of degradation following addition of Mn^* here as well as to the yeast 
ceil extracts used in Table 4. 

30 Table 4. Metalloproteases in Saccliaromyces cerevislae. Search performed in Swiss-Prot 
Sequence Retrieval System (SRS) httD://www.exp asv.ch/. Protease assay performed in two 
independent assays (A and B) using extracts from the metalloprotease deficient strains. 
The amount of CCK-22 is measured with Ab 89009 and the total amount of CCK is 
measured after tryptic cleavage with Ab 89009. Putative metalloproteases are marked 

35 with *. 



Name 


Swiss-Prot 


ORF 


CCK-22 


Total CCK [nM] 


Fraction 




acc# 




rnMl 


CCK-22 
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Ai 


Bi 


A-. 


B^ 


Ai/ A^ 




AAPl 


P37898 


YHR047C 


3.2 


3.2 


36 


32 


0.09 


0.10 


AFG3 


P39925 


YER017C 


2.8 


2.4 


23 


24 


0.12 


0 10 


APEl 


PI 4904 


YKL103C 


4.4 


3.7 


34 


28 


V. 1 J 


n 1*^ 


APE2 


P32454 


YKL157W 


APE3 


P37302 


YBEt286w 


DPP3 


008225 


YOL057W 


4.3 


4.6 


31 


35 


0 14 


0 1*^ 


LTA4 


010740 


YNL045W 


3.9 


4.0 


32 


29 


0.12 




MIPl 


P35999 


YKL134C 


3.4 


3.3 


34 


33 


0.10 


0.10 


PRDl 


P25375 


YCL057W 


2.4 


2.8 


28 


26 


0.09 


0 1 1 

v. 1 J 


QR17* 


P43122 


YDL104C 


2,8 


2.9 


23 


24 


0.12 


0.12 


RCAl 


P40341 


YMR089C 


4.2 


3-5 


31 


27 


0.14 




STE23 


006010 


YLR389C 


2.6 


2.3 


25 


20 


0,10 


0 12 


STE24 


P47154 


YJR117W 


3.4 


2.8 


19 


17 


0.18 


0 16 


YBS4* 


P38244 


YBR074W 


2.6 


2.7 


27 


29 


0.10 


0 09 


YHR3* 


P38821 


YHR113W 


2.5 


2.4 


26 


26 


0 10 


n no 


YHT2* 


P38836 


YHR132C 


3.8 


3.3 


34 


32 


0.11 


0.10 


YIK8* 


P40483 


YILlOSw 


5.7 


5.2 


43 


39 


0.13 


0.13 


YIN7* 


P40462 


YIL137C 


3.5 


2.8 


23 


20 


0.15 


0.14 


YK18* 


P36i32 


YKR038C 


2.9 


2.1 


23 


20 


0.13 


0.11 


YMEl 


P32795 


YPR024W 


2.3 


2.7 


25 


25 


0.09 


0.11 


MAS2 


P11914 


YHR024C 


ND, Lethal genes 


MASl 


PI 0507 


YLR163C 


AXLl 


P40851 


YPR122W 


3,5 


3.4 


31 


30 


0.11 


0.11 


CYMl* 


P32898 


YDR430C 


0.6 


0.4 


42 


39 


0.01 


0.01 


YOJ8* 


O 12496 


YOL098C 


3.6 


3.8 


35 


34 


0.10 


0.12 
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Extracellular yapsin activity 

To Investigate whether any protease activity Is secreted or attached extracellularly to 
the plasnna mennbrane, the protease activity was assayed in media and with intact 
5 yeast cells. No degradation of CCK-33 occurred after 1 h of incubation at 30*»C using 
media from exponential growing UY123 celts in accordance with earlier obsen/ations 
(Rourke et al., 1997). During incubation with Intact yeast ceils, cleavage to expose the 
N-terminus of CCK-22 could be measured (Fig. 6) however, this protease activity could 
not be abolished by the Inhibitors investigated (data not shown). By using intact cells 

10 containing gene disruptions of YPSl, YPS2 and YPS3 (Table I) the fraction of processed 
CCK-22 decreases by deletion of each of the three aspartyl proteases compared to wild 
type cells (Fig, 6). These data show that the three proteases all have extracellular 
protease activity, which can cleave at Lys^^ In proCCK. Preliminary results indicate that 
gene disruption of YPS7 decreases extracellular Lys^* processing in amounts 

15 comparable to the YPSl deletion (unpublished results). 

Expression of proCCK in metalloprotease deficient strains 

Based on previously described metal ioproteases in S, cerevisiae with endoproteolytic 
20 activity (Adames et a!., 1995; Schmidt et al., 2000), gene deletion strains of AXLl 
(UY201) and STE24 (UY202) were initially prepared In BJ2168. ProCCK expression in 
these strains showed that proteolysis after Lys^^ was unchanged compared to wild type, 
and It was decided to test the remaining metalloprotease deficient strains LJY123, 
UY203, UY204 and the metalloprotease deficient strains obtained through Euroscarf 
25 (Table I) for their ability to secrete CCK-22 Cmitochondrlal peptidases were not 

included). The CCK-22 Immuno-reactivlty did not change significantly among the CCK 
producing metalloprotease deficient strains (data not shown), and no protease 
responsible for the processing of heterologous expressed proCCK to CCK-22 was 
identified by this approach. 

30 

CYMl encodes a protease that can release the free N-termlnus of CCK-22 

Cell extracts were prepared from each of the viable metalloprotease deficient strains 
and tested in the in vitro protease assay to investigate whether any reduction in 
35 proteolysis was measurable. In this assay 1 mM Mn^"^ and 1 mM bestatin were included 
prior to the addition of Ac-CCK-33-Gly, since It was found that the recovery was 80- 
90% compared to 30% without addition of these aminopeptidase Inhibitors (data not 
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Shown). Deletion of CYMl almost abolished the protease activity, whereas none of the 
other metalloprotease deficient strains showed a significant change in the biosynthesis 
of CCK-22 (Table 4). 



5 Expression of CYMl on a multicopy plasmid Increases the fraction of matured CCK-22 In 
vitro 

To determine whether the amount of synthetlzed CCK-22 correlates with the amount of 
Cymlp In vitro, Cymlp was expressed on a multicopy plasmid and the fraction of 

10 synthetlzed CCK-22 analysed over time. Cell extract from BJ2168 transformed with 
PRS425 CYMl and the control transformed with the empty pRS425 vector were used In 
the in vitro protease assay with 1 mM Mn^* In which the reactions were terminated 
after 15, 30, 45 and 60 min. The CCK-22 Immuno-reactlvlty was measured with Ab 
89009 and the remaining CCK-33 was measured with the same antibody after tr/ptic 

15 cleavage (Fig. 7). Expression of CYMl on a multicopy plasmid enhanced the rate of 
CCK-22 production several fold. However, the inventors also observed an increased 
degradation of CCK-33 and CCK-22 (Fig. 7 B). When the same experiment was 
performed at pH 6.0 and pH 7.5, there was a dramatically increased degradation and 
after 30 min Incubation the CCK immuno-reactivity was undetectable at pH 6.0 (data 

20 not shown). These results show that the Lys-specific cleavage In CCK-22 maturation in 
vitro is dependent on the amount of Cymlp. 

Expression of proCCK in cyml mutant strain enhances CCK secretion 

25 To elucidate the role of CYMl In the biosynthesis of CCK-22 In vivo, gene deletions of 
CYMl were prepared in the vacuole protease deficient strain, BJ2168, and Isogenic 
kex2 strain. Deletion of CYMl resulted in an approximately 40% increase In the total 
amount of proCCK within the cells (Fig. 8 A) accompanied by a similar decrease in 
CCK-22 independent of KEX2 disruption (Fig. 8 C). Also the secreted amount of total 

30 CCK in the cyml strains increased with more than 60% (Fig. 8 B), but unlike the 
fractional decrease in intracellular CCK-22 there was an increase in the extracellular 
fractions of CCK-22 compared to vacuole protease deficient strain and the isogenic 
kex2 strain (Rg. 8 D). 
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Expression of CCK K->A mutant leads to Intracellular CCK accumulation comparable to 
the accumulation of wild type CCK in a cyml strain 

5 The observations that a gene disruption of CYMl causes an increase in intracellular 
concentrations of CCK (Fig. 8 A) raise the question whether the proteolytic activity of 
Cymlp leads to degradation of CCK-22 prior to translocation Into the ER. Therefore, the 
Inventors examined the Intracellular CCK content in strains expressing CCK where the 
maturation of CCK-22 has been eliminated by using the Lys®^ -> Ala^^ mutant. 

10 Transformants of this CCK mutant in the vacuole protease deficient strain, BJ2168 and 
the Isogenic cyml strain were analysed using Ab 7270 after trypsin and 
carboxypeptidase B treatment and there was an increase in the intracellular CCK 
immuno-reactivity for this construct compared to expression of wild type CCK (Fig. 9). 
l^utant CCK (K^A) and wild type CCK transformants resulted In an Increase In the 

15 intracellular proCCK concentration when expressed in BJ2168 and the CYMl disruption 
strain, respectively. The Increase in intracellular proCCK was not additive showing that 
proteolytic activity of Cymlp leads to degradation of CCK-22 prior to translocation into 
the ER. 

20 Expression of proCCK in aspartyl protease deficient strains 

The Lys"-specific cleavage of proCCK was analysed in null mutants of YPSl, -2 and -3, 
where the Intra- and extracellular amount of CCK-22 was measured from exponentially 
growing cells of wild type yeast, BY4705 and the isogenic aspartyl protease deficient 

25 strains transformed UY13, -14 and -15 with proCCK. Both Intra- and extracellular CCK 
immuno-reactivity of BY4705 was lowered more than 10 fold compared to the vacuolar 
protease deficient strain, BJ2168 (data not shown). The Intracellular fraction of CCK-22 
decreased significantly from approximately 28% In wild type cells to 17% in the ypsl 
strain, whereas no additional decrease could be measured by gene disruptions of YPS2 

30 and YPS3 (Fig. 10 A). The extracellular fraction of CCK-22 did however show that 
Ypslp, Yps2p and YpsSp all are involved in the biosynthesis of CCK-22 and that the 
triple mutant reduced the fraction of CCK-22 to 2/3 compared to wild type yeast (Fig. 
10 B). 



35 



CYMl disruption leads to a two-fold Increase In the total amount of secreted wild type 
CCK as well as the CCK K->R mutant 
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To elucidate whether Cymlp cleaves C-termlnal!y to a single Arg residue, the CCK 
(K->R) mutant was expressed In the vacuolar protease deficient strain, BJ2168 and the 
isogenic cyml strain. The concentration of both intra- and extracellular CCK was 
compared to wild type CCK expressed in these strains. The total amount of the mutated 
5 CCK (K-»R) was Increased both intra- and extracellular comparable to wild type CCK 
(Fig. 11). Both wild type CCK and the Lys^^-» Arg" mutant showed more than a two- 
fold increase in the measurable amount of extracellular CCK when expressed In the 
cyml strain (Fig, 11 B). 

10 Usage of Cymlp activity in the synthesis of peptides 

Another aspect of the invention is to use the activity from Cymlp, either expressed 
from its own promoter or from a strong constitutive promoter such as PGKl, ADHl or 
TPIl, or the induceable GALl promoter to produce an increased amount cytosolic 
15 Cymlp activity. As previously mentioned, the synthesis of CCK-22 Is significantly 
Increased when CYMl is transcribed from its own promoter on a 2^ plasmid (Fig. 7). 
Transcription can either be performed from a plasmid containing the promoter, CYMl 
and a terminator, or by introducing the desired promoter into the genome by 
heterologous recombination to substitute the endogenous promoter of CYMl. 

20 

The activity can be used intracellularly to generate peptides that do not require post- 
translatlonal modifications from the secretory pathway, such as disulfide bond 
formation, /V- and O-glycosylatlon or exoprotease activity. 

25 The role of Cymlp cytosolic activity in intracellular peptide synthesis is shown in the 
biosynthesis of CCK-22 in wild type cells compared to the isogenic strain with a CYMl 
disruption, which shows a significant increase in the amount of CCK-22 (Fig. 8C). 
Synthesis of the peptide of interest should be performed in such a way that 
translocation into the endoplasmatlc reticulum (ER) is avoided. This can be performed 

30 either by removal of the hydrophobic amino-terminal signal sequence from proteins 
that enter the ER post-translationaliy, or by expression in a temperature sensitive 
secretory mutant such as secSl, which abolishes translocation of secretory peptides 
Into the ER when the temperature Is elevated to 37**C, 

35 The propeptide or prepropeptide of interest will then be cytosolicly located and a 

potential substrate for Cymlp. Release of the peptide from its precursor will be carried 
out by the Cymlp activity by Introduction of the cleavage site seen from proCCK, which 
results In the release of Gly-extended CCK-22 after endoproteolytic processing C- 
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terminal to Lys" (Ser-Ile-Val-Lys^^ 4.) (Fig. 13A). If the peptide of Interest Is GLPl, 
synthesis can be performed as a fusion to a Cymlp cleavage site, which could be part 
of proCCK (Fig. 13B). The peptide of Interest will then accumulate In the cytosol and 
can be purified from sedlmented cells after lysis. 

5 

Expression of proBNP in cyml mutant strain enhance proBNP secretion 

To elucidate the role of CYMl In the biosynthesis of proBNP in vivo, proBNP was 
expressed in the the vacuole deficient strain, BJ2168 and the three protease deficient 

10 Isogenic strains, Acyml::LEU2 (UY430), Aypsl::TRPl (UY440) and a Acyml::LEU2 
Aypsl::TRPl (UY431). Deletion of cmi resulted In an approximately 100% increase in 
the total amount of secreted proBNP, whereas the proBNP secretion was Independent 
on disruption of the gene encoding the aspartyl protease, Ypslp and was thereby 
similar to the wildtype strain (Fig. 15A). Disruption of both Cyml and Ypsl was as 

15 expected similar to the secreted amount in a cyml mutant (Fig. 15A). 

Analysis of the proBNP In secreted from a cyml mutant 

To analyse the proBNP expressed In Saccharomyces cerevisiae, media from a cyml 
20 mutant was applied to FPLC chromatography and analysed by RIA using Ab. 98192. 
The peak elutlng from fraction 34-39 corresponds to intact proBNP, whereas the peak 
eluting in fraction 53-62 is a processed form of proBNP, most likely the proBNP 
fragment 1-76 (Fig. 15B). The release of fragment 1-76 and BNP-32 from proBNP, is 
due to cleavage after a single Arg residue and Is probably due to either Kex2 or Ypsl 
25 activity. 

Discussion 

The secreted polypeptides varies with the growth conditions, the fraction of CCK-22 
increasing when the culture reaches stationary phase, while the Intracellularly 
processed fraction remains unaltered under stress conditions. The Increase In 

30 extracellular cleavage to CCK-22 as the cells enter stationary phase could indicate that 
extracellular endoproteases with the ability to process proCCK to CCK-22 are secreted 
or expressed on the cell membrane. It is known that the aspartyl proteases, Ypslp and 
Yps2p, exhibit cell surface activity (Komano et aL, 1998). In addition, it has previously 
been shown that heterologous peptide expression In a ypsl strain improved the 

35 recovery of proteins and peptides like albumin, glucagon, GLPl, GLP2 and CART by 
Inhibiting proteolysis C-terminal to mono-basic residues (Egel-MltanI et aL, 2000; 
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Kerry-Williams et al., 1998). Thus, recent studies ((Egel-|v|itani et al., 2000; Kerry- 
Williams et al., 1998) and those of the present inventor) show the importance of 
collecting secreted peptides during exponential growth in order to avoid additional 
extracellular processfng. 

5 

ProCCK expressed in a vacuole protease deficient strain showed 30% intracellular 
processing at Lys" In proCCK. The fraction of extracellular Lys"-processing Is, 
however, decreased to 2/3 of the observed fraction within Intact yeast cells, which 
reveals an intracellular degradation of CCK-22 prior to secretion. The Increase in 
extracellular proteolysis under limited nutrient resources Is probably due to an 
activation or upregulation in transcription of the extracellular proteases under limited 
nutrient resources as seen with the upregulation of YPSl transcription during stationary 
phase (Gasch et al., 2000). Part of the cell surface activity can be assigned to the 
yapsins, Ypslp, Yps2p and Yps3p, but some extracellular activity was sustained even In 
15 the triple mutant. 



10 



20 



25 



30 



35 



In the present study, the Inventors have shown that deletion of KEX2 causes a 5 fold 
reduction in both the Intracellular and extracellular Lys"-cleavage. The kex2 strain 
expressing proCCK do not only alter the cleavage of Lys" in proCCK, It also changes 
the intracellular retention time of CCK as the intracellular concentration of CCK peptides 
is reduced with more than 60%, while the extracellular CCK concentration is Increased 
by almost 60% compared to wild type yeast, i^1oreover, analysis of the secreted CCK 
peptides from the kexl kex2 double mutant and the kex2 mutant showed 
disappearance of the Tyr«-Val*° degradation product. Thus, the removal of Lys" by 
Kexlp was abolished in a kex2 strain Indicating an enhanced secretion rate through the 
trans-Golgl network. These results and the observations on the rapid secretion of 
proCCK suggest that It may be the intracellular retention caused by Kex2p that leads to 
an Increased synthesis of CCK-22 In wild type yeast by Ypslp and probably to some 
extent by Kex2p. 

The type of protease responsible for the intracellular maturation of CCK-22 was 
investigated in an In vitro protease assay using a crude extract of S. cerevlslae to 
analyse the processing of synthetic human CCK-33 to CCK-22 in the presence of 
different inhibitors. By not including detergents in extraction of protease activity, 
activity from Kex2p as well as the GPI-anchored yapsins was avoided (Azar/an et al., 
1993; Fuller et al., 1989; Komano et al., 1999). Of the Inhibitors tested, the proteolysis 
was only inhibited by EDTA and 1,10 ortho-phenanthrollne, and the activity could be 
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restored by addition of the divalent cations Zn^"*", Co^"*" and Mn^**". This indicated that a 
metalloprotease participates in the nnaturation of CCK-22. 

None of the candidate metalloproteases contain an obvious signal peptide to direct the 
5 protein into ER. Therefore, the inventors investigated strains deficient in each of the 
metalloproteases with the exception of nnltochondrial proteases. Expression of proCCK 
In each of the strains resulted in unaltered maturation of CCK-22 similar to that seen in 
wild type yeast. However, by using the in vitro protease assay the Inventors identified 
Cymlp as an endoprotease performing post-Lys cleavage of CCK-33. That Cymlp can 
10 cleave Lys^^ In proCCK was verified by overexpresslon studies, showing a several fold 
Increase In enzyme activity. 



Intracellular synthesis of CCK-22 was decreased in a cyml strain accompanied by an 
increased concentration of total proCCK. In contrast, the fraction of extracellular 

15 CCK-22 was increased compared to wild type yeast with a parallel increase In total 
CCK. These findings are in accordance with a cytosolic location of the Cymlp activity 
lil<e most Insulin-degrading enzymes (Bal et ah, 1996) and show that It acts on the 
preproMfalp-proCCK construct prior to translocation Into the endoplasmatic reticulum 
Thus, the pre-translocational degradation of proCCK Is decreased by CYMl disruption 

20 and the total production Increased. 



Expression of proBNP as a fuslonpeptide to the preproMfalp sequence In a cymlA 
mutant shows a two-fold increase of the extracellular proBNP content compared to the 
wild type strain. Analysis of the secreted proBNP by chromatography disclosed that two 
25 major forms were present. One is the entire proBNP, whereas the other is the proBNP 
fragment 1-76, thus the biologically active BNP-32 is also syntheslsed, though not 
detectable In the present assay. The release of proBNP fragment 1-76, most likely 
depends on the Kex2p activity, however this could not be tested In the present assay, 
since the release of proBNP depends on both Kex2p and Kexlp. 

30 

Ail publications discussed above are incorporated herein in their entirety. 

It will be appreciated by persons skilled in the art that numerous variations and/or 
modifications may be made to the invention as shown in the specific embodiments 
35 without departing from the spirit or scope of the invention as broadly described. The 
present embodiments are, therefore, to be considered in all respects as illustrative and 
not restrictive. 
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Claims 

1. A method for producing a protein of interest in a host cell, wherein said host cell has 
been genetically modified in order to express significantly reduced levels of a 
metalloprotease comprising a HXXEH motif (SEQ ID NO 1), compared to the corresponding 
5 non-modified cell when cultured under identical conditions, the method comprising 

a) introducing Into the host cell a nucleic acid sequence encoding the protein of 
interest, 

10 b) cultivating the host cell of step (a) In a suitable growth medium for production of 

the protein of Interest, and 

c) isolating the protein of Interest. 

15 2. A method according to claim 1, wherein the metalloprotease further comprises a 

glutamic acid residue between 70 and 80 amino acids C-terminal of the second His residue 
in the HXXEH motif. 

3. A method according to any of the preceeding claims, wherein the metalloprotease 

20 further comprises a glysine residue 3 amino acids N-termlnal of the first His residue In the 
HXXEH motif. 

4. A method according to any of the preceeding claims, wherein the metalloprotease 
further comprises a glysine residue 5 amino acids C-terminal of the second His residue in 

25 the HXXEH motif. 

5. A method according to any of the preceeding claims, wherein the metalloprotease 
further comprises a lysine residue 8 amino acids C-termlnal of the second His residue In 
the HXXEH motif. 

30 

6. A method according to any of the preceeding claims, wherein the metalloprotease 
further comprises a tyrosine residue 9 amino acids C-termlnal of the second His residue in 
the HXXEH motif. 

35 7. A method according to any of the preceeding claims, wherein the metalloprotease 

further comprises a proline residue 10 amino acids C-termlnal of the second His residue In 
the HXXEH motif. 
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8. A method according to any of the preceeding claims, wherein the metalloprotease 
further comprises the consensus sequence SEQ ID NO 2. 

9. A method according to any of the preceeding claims wherein the metalloprotease further 
5 comprises the consensus sequence SEQ ID NO 3. 

10. A method according to any of the preceeding claims, wherein the metalloprotease 
further comprises a NAXTXXXXT motif between 20 and 30 amino acids C-termlnal of the 
second His residue In the HXXEH motif. 

10 

11. A method according to any of the preceeding claims, wherein the metalloprotease is 
selected from: 

i) any one of the group consisting of SEQ ID NO*s 4 to 15, and 

15 

11) a sequence which is at least 80% identical to any one of SEQ ID NO*s 4 to 15. 

12. A method according to any of the preceeding claims, wherein the metalloprotease is at 
least 80% identical to the SEQ ID NO: 4. 

20 

13. A method according to any of the preceeding claims, wherein the total amount of the 
protein of Interest is increased at least 5% compared the corresponding non-modified cell 
when cultured under identical conditions. 

25 14. A method according to any of the preceeding claims, wherein the total amount of the 
protein of interest is increased at least 50% more than the corresponding non-modified cell 
when cultured under identical conditions. 

15. The method according to any of the preceeding claims, in which the host cell is a 
30 prokaryotic cell. 

16- The method according to any of claims 1-14, In which the host cell is a eukaryotic cell. 

17. The method according to claim 16, in which the host cell is a non-filamentous fungal 
35 cell. 

18. The method according to claim 16, in which the host cell is a filamentous fungal cell. 



19. The method according to claim 17, in which the host cell is a strain of Saccharomycces. 
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20. The method according to claim 19, in which the host cell is Saccharomyces cerevisiae. 

21. A host cell useful for the expression of a protein of interest, wherein said cell has been 
5 genetically modified in order to express significantly reduced levels of a metalloprotease 

comprising a HXXEH motif (SEQ ID NO 1) than the corresponding non-modified cell when 
cultured under identical conditions. 

22. A host cell according to claim 21, wherein the metalloprotease further comprises the 
10 consensus sequence SEQ ID NO 3. 
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ATGAGATTTCCTTCAATTTTTACTGCAGTTTTATTCGCAGCATCCTCCGCATTAGCTGCT 
1 + + ^ ^ ^ ^ 

TACTCTAAAGGAAGTTAAAAATGACGTCAAAATAAGCGTCGTAGGAGGCGTAATCGACGA 
MRFPSIFTAVLFAASSALAA - 

CCAGTCAACACTACAACAGAAGATGAAACGGCACAAATTCCGGCTGAAGCTGTCATCGGT 
61 + + ^ ^ ^ ^ 

GGTCAGTTGTGATGTTGTCTTCTACTTTGCCGTGTTTAAGGCCGACTTCGACAGTAGCCA 
PVNTTTEDETAQIPAEAVIG 

TACTTAGATTTAGAAGGGGATTTCGATGTTGCTGTTTTGCCATTTTCCAACAGCACAAAT 
121 + + ^ ^ ^ 

ATGAATCTAAATCTTCCCCTAAAGCTACAACGACAAAACGGTAAAAGGTTGTCGTGTTTA 
YLDLEGDFDVAVLPFSNSTN 

AACGGGTTATTGTTTATAAATACTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTA 
181 + + + ^ ^ ^ 

TTGCCCAATAACAAATATTTATGATGATAACGGTCGTAACGACGATTTCTTCTTCCCCAT 
NGLLFINTTIASIAAKEEGV 

TCTTTGGATAAAAGAGAGGCTGAAGCTCACCCGCTGGGCAGCCCCGGTTCAGCCTCGGAC 
241 + — + + + ^ ^ 

AGAAACCTATTTTeTCTCCGACTTCGAGTGGGCGACCCGTCGGGGCCAAGTCGGAGCCTG 
SLDKREAEAHPLGSPGSASD 

TTGGAAACGTCCGGGTTACAGGAGCAGCGCAACCATTTGCAGGGCAAACTGTCGGAGCTG 
301 + + + ^ ^ ^ 

AACCTTTGCAGGCCCAATGTCCTCGTCGCGTTGGTAAACGTCCCGTTTGACAGCCTCGAC 
LETSGLQEQRNHLQGKLSEL 

CAGGTGGAGCAGACATCCCTGGAGCCCCTCCAGGAGAGCCCCCGTCCCACAGGTGTCTGG 
361 + + + ^ ^ ^20 

GTCCACCTCGTCTGTAGGGACCTCGGGGAGGTCCTCTCGGGGGCAGGGTGTCCACAGACC 
QVEQTSLEPLQESPRPTGVW 

AAGTCCCGGGAGGTAGCCACCGAGGGCATCCGTGGGCACCGCAAAATGGTCCTCTACACC 
421 + + + ^ ^ ^ 

TTCAGGGCCCTCCATCGGTGGCTCCCGTAGGCACCCGTGGCGTTTTACCAGGAGATGTGG 
KSREVATEGIRGHRKMVLYT 

CTGCGGGCACCACGAAGCCCCAAGATGGTGCAAGGGTCTGGCTGCTTTGGGAGGAAGATG 
481 + + + + ^ ^ 

GACGCCCGTGGTGCTTCGGGGTTCTACCACGTTCCCAGACCGACGAAACCCTCCTTCTAC 
LRAPRSPKMVQGSGCFGRKM 

GACCGGATCAGCTCCTCCAGTGGCCTGGGCTGCAAAGTGCTGAGGCGGCATTAA 
541 + + + + ^ 

CTGGCCTAGTCGAGGAGGTCACCGGACCCGACGTTTCACGACTCCGCCGTAATT 
DRISSSSGLGCKVLRRH* 



Fig. 14A 
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ATGAGATTTCCTTCAATTTTTACTGCAGTTTTATTCGCAGCATCCTCCGCATTAGCTGCT 
1 + + + + + + 60 

TACTCTAAAGGAAGTTAAAAATGACGTCAAAATAAGCGTCGTAGGAGGCGTAATCGACGA 

MRFPSIFTAVLFAASSALAA 

CCAGTCAACACTACAACAGAAGATGAAACGGCACAAATTCCGGCTGAAGCTGTCATCGGT 
61 + + + + + + 220 

GGTCAGTTGTGATGTTGTCTTCTACTTTGCCGTGTTTAAGGCCGACTTCGACAGTAGCCA 

PVNTTTEDETAQIPAEAVIG 

TACTTAGATTTAGAAGGGGATTTCGATGTTGCTGTTTTGCCATTTTCCAACAGCACAAAT 
121 + + + + + + 

ATGAATCTAAATCTTCCCCTAAAGCTACAACGACAAAACGGTAAAAGGTTGTCGTGTTTA 

YLDLEGDFDVAVLPFSNSTN 

AACGGGTTATTGTTTATAAATACTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTA 
181 + + + + + ^ 240 

TTGCCCAATAACAAATATTTATGATGATAACGGTCGTAACGACGATTTCTTCTTCCCCAT 

NGLLFINTTIASIAAKEEGV 

TCTTTGGATAAAAGAGAGGCTGAAGCTAGCCCCAAGATGGTGCAAGGGTCTGGCTGCTTT 
241 + + + + + ^ 3QQ 

AGAAACCTATTTTCTCTCCGACTTCGATCGGGGTTCTACCACGTTCCCAGACCGACGAi^ 

SLDKREAEASPKMVQGSGCF 

GGGAGGAAGATGGACCGGATCAGCTCCTCCAGTGGCCTGGGCTGCAAAGTGCTGAGGCGG 
301 + + + _+ + ^ 

CCCTCCTTCTACCTGGCCTAGTCGAGGAGGTCACCGGACCCGACGTTTCACGACTCCGCC 
GRKMDRISSSSGLGCKVLRR 

CATTAA 

361 366 

GTAATT 

* " Fig. 14B 
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ATGAGATTTCCTTCAATTTTTACTGCAGTTTTATTCGCAGCATCCTCCGCATTAGCTGCT 
1 + ^. ^ ^ ^ ^ 

TACTCTAAAGGAAGTTAAAAATGACGTCAAAATAAGCGTCGTAGGAGGCGTAATCGACGA 

MRFPSIFTAVLFAASSALAA 

CCAGTCAACACTACAACAGAAGATGAAACGGCACAAATTCCGGCTGAAGCTGTCATCGGT 
61 + + + ^ ^ ^ 

GGTCAGTTGTGATGTTGTCTTCTACTTTGCCGTGTTTAAGGCCGACTTCGACAGTAGCCA 

PVNTTTEDETAQIPAEAVIG 

TACTTAGATTTAGAAGGGGATTTCGATGTTGCTGTTTTGCCATTTTCCAACAGCACAAAT 
121 + + + ^ ^ ^ 

ATGAATCTAAATCTTCCCCTAAAGCTACAACGACAAAACGGTAAAAGGTTGTCGTGTTTA 

YLDLEGDFDVAVLPFSNSTN 

AACGGGTTATTGTTTATAAATACTACTATTGCCAGCATTGCTGCTAAAGAAGAAGGGGTA 
181 + + + ^ ^ ^ 

TTGCCCAATAACAAATATTTATGATGATAACGGTCGTAACGACGATTTCTTCTTCCCCAT 

NGLLFINTTIASIAAKEEGV 

TCTTTGGATAAAAGAAGCCCCAAGATGGTGCAAGGGTCTGGCTGCTTTGGGAGGAAGATG 
241 + + + ^ ^ ^ 

AG7W\ACCTATTTTCTTCGGGGTTCTACCACGTTCCCAGACCGACGAAACCCTCCTTCTAC 



SLDKRSPKMVQGSGC 



F G R K M 



GACCGGATCAGCTCCTCCAGTGGCCTGGGCTGCAAAGTGCTGAGGCGGCATTAA 
301 + + + ^ _^ 

CTGGCCTAGTCGAGGAGGTCACCGGACCCGACGTTTCACGACTCCGCCGTAATT 



DRISSSSGLGCKVLRRH 



* 



Fig. 14C 
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SEQUENCE LISTING 



10 



<110> Cymlp A/S 

J0nson, Lars 
Rehfeld, Jens F. 
Johnsen, Anders H. 

<120> Methods for increasing the production of 
a recombinant polypeptide from a host cell 



<130> P32077PC01 

<150> DK/PA 2002 01391 
15 <151> 2002-09-20 

<160> 68 

<170> FastSEQ for Windows Version 4.0 

20 





<210> 


1 




<211> 


5 




<212> 


PRT 


25 


<213> 


Artificial Sequence 




<220> 






<223> 


Pitrilysin consensus 


30 


<220> 






<221> 


MISC FEATURE 




<222> 


(2) (2) 




<223> 


X = any amino acid 


35 


<220> 






<221> 


MISC FEATURE 




<222> 


(3) . . (3) 




<223> 


X = any amino acid 



40 



45 



<400> 1 



His Xaa Xaa Glu His 





<210> 


2 




<211> 


46 




<212> 


PRT 


50 


<213> 


Artificial Sequence 




<220> 






<223> 


Pitrilysin consensus 


55 


<220> 






<221> 


MISC FEATURE 




<222> 


(2).. (2) 
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<223> X = any amino acid 



<220> 

5 <221> MISC_FEATURE 

<222> (3). -(3) 

<223> X = any amino acid 

<220> 

<2 2 1 > MIS COFEATURE 

10 <222> (5) . . (5) 

<223> X = any amino acid 



<220> 

15 <221> MISC_FEATURE 

<222> (6).. (6) 

<223> X = any amino acid 



20 <220> 

<221> MISC_FEATURE 

<222> (9).. (9) 

<223> X ~ any amino acid 

25 

<220> 

<221> MISC_FEATURE 

<222> (10) . . (10) 

<223> X - any amino acid 

30 



<220> 

<221> MISC_FEATURE 

<222> (11).. (11) 

35 <223> X = any amino acid 



<220> 

<221> MISC__FEATURE 

40 <222> (12).. (12) 

<223> X = any amino acid 



<220> 

45 <221> MISC_FEATURE 

<222> (14).. (14) 

<223> X = any amino acid 



50 



55 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(15) . . (15) 
X = any amino acid 



MISC_FEATURE 
(16) . . (16) 
X = any amino acid 
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<220> 

<221> MISC_FEATURE 

5 <222> (17)-. (17) 

<223> X = any amino acid 

<220> 

<221> MISC_FEATURE 

<222> (18).. (18) 

10 <223> X « any amino acid 



<220> 

<221> MISC_FEATURE 

15 <222> (19).. (19) 

<223> X - any amino acid 



<220> 

20 <221> MISC^FEATURE 

<222> (20) . , (20) 

<223> X - any amino acid 



25 <220> 

<221> MISC_FEATURE 

<222> (21).. (21) 

<223> X = any amino acid 

30 

<220> 

<221> MIS COFEATURE 

<222> (22).. (22) 

<223> X = any amino acid 



<220> 

<221> MISC^FEATURE 

<222> (23).. (23) 

40 <223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

45 <222> (24) . . (24) 

<223> X « any amino acid 



<220> 

50 <221> MISC_FEATURE 

<222> (25).. (25) 

<223> X = any amino acid 



55 <220> 

<221> MISC_FEATURE 

<222> (26).. (26) 

<223> X = any amino acid 
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<220> 

<221> MISC_FEATURE 

<222> (27) . . (27) 

5 <223> X = any amino acid 
<220> 

<221> MISC_FEATURE 

<222> (28).. (28) 

<223> X = any amino acid 



<220> 

<221> MISC_FEATURE 

<222> (29) . . (29) 

15 <223> X — any amino acid 



<220> 

<221> MISC_FEATURE 

20 <222> (30) . . (30) 

<223> X = any amino acid 



<220> 

25 <221> MIS COFEATURE 

<222> (31) . . (31) 

<223> X = any amino acid 



30 



35 



40 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(32) (32) 
X = any amino acid 



MISC_FEATURE 
(33) . . (33) 
X 5= any amino acid 



<220> 
<221> 
<222> 
45 <223> 



MISC_FEATURE 
(34) . . (34) 
X = any amino acid 



<220> 

<221> MISC_FEATURE 

50 <222> (35) . - (35) 

<223> X = any amino acid 



<220> 

55 <221> MISC_FEATURE 

<222> (36).,{36) 

<223> X = any amino acid or is absent 
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<220> 

<221> MISC_FEATURE 

<222> (37) . . (37) 

<223> X = any amino acid or is absent 
5 <220> 

<221> MISC_FEATURE 

<222> (40).. (40) 

<223> X = any amino acid 

10 

<220> 

<221> MISC_FEATURE 

<222> (42).. (42) 

<223> X = any amino acid 

15 

<220> 

<221> MISC_FEATURB 

<222> (43).. (43) 

20 <223> X = any amino acid 

<220> 

<221> MISC_FEATURE 

25 <222> (44) . . (44) 

<223> X = any amino acid 

<220> 

30 <221> MISC_FEATURE 

<222> (45).. (45) 

<223> X = any amino acid 

35 <400> 2 

Gly Xaa Xaa His Xaa Xaa Glu His Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 
15 10 15 

40 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
20 25 30 

45 Xaa Xaa Xaa Xaa Xaa Asn Ala Xaa Thr Xaa Xaa Xaa Xaa Thr 
35 40 45 

<210> 3 
50 <211> 4 6 
<212> PRT 

<213> Artificial Sequence 
55 <220> 

<223> Pitrilysin consensus sequence 
<220> 

<221> MISC FEATURE 
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<222> (2).. (2) 

<223> X = any amino acid 

<220> 

5 <221> MISC_FEATURE 
<222> (3).. (3) 
<223> X = any amino acid 



10 



15 



20 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(5) (5) 

X = any amino acid 



MISC_FEATURE 
(6) . . (6) 

X — any amino acid 



<220> 
<221> 
<222> 
25 <223> 



MISC_FEATURE 
(9) . . (9) 

X = any amino acid 



<220> 

<221> MISC_FEATURE 

30 <222> (10) . . (10) 

<223> X = any amino acid 



<220> 

35 <221> MISC_FEATURE 

<222> (11).. (11) 

<223> X = any amino acid 



40 



45 



50 



<220> 
<221> 

<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(12) . . (12) 
X = any amino acid 



MISC_FEATURE 
(14) . . (14) 
X = S or T 



<220> 
<221> 
<222> 
55 <223> 



MISC_FEATURE 
(15) . . (15) 
X = any amino acid 



<220> 
<221> 



MISC FEATURE 
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<222> (19) . . (19) 

<223> X - any amino acid 

<220> 
<221> MISC_FEATURE 
5 <222> (20) . . (20) 

<223> X = any amino acid 



<220> 

10 <221> MISC_FEATURE 

<222> (21).. (21) 

<223> X = any amino acid 



15 



20 



25 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(22).. (22) 
X = any amino acid 



MISC_FEATURE 
(23) . . (23) 
X = any amino acid 



<220> 
<221> 
<222> 
30 <223> 



MISC_FEATURE 
(24) . . (24) 
X = any amino acid 



<220> 
<221> 
35 <222> 
<223> 



MISC_FEATURE 
(25) . . (25) 
X = any amino acid 



<220> 
40 <221> 
<222> 
<223> 



MISC_FEATURE 
(26) . . (26) 
X = any amino acid 



45 <220> 

<221> MISC_FEATURE 

<222> (27)_(27) 

<223> X = any amino acid or is absent 

50 

<220> 

<221> MIS C_FE AT ORE 

<222> (28) . . (28) 

<223> X = any amino acid or is absent 

55 



<220> 
<221> 
<222> 



MISC_FEATURE 
(30) . . (30) 
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<223> X = any amino acid 

<220> 
<221> 
<222> 
5 <223> 



MISC_FEATURE 
(31) . . (31) 
X = any amino acid 



<220> 

<221> MISC_FEATURE 

10 <222> (32).. (32) 

<223> X = any amino acid 



<220> 

15 <221> MISC_FEATURE 

<222> (33) . . (33) 

<223> X = any amino acid 



20 



25 



30 



<220> 
<221> 
<222> 
<223> 



<220> 
<221> 
<222> 
<223> 



MISC_FEATURE 
(34) (34) 
X = any amino acid 



MISC_FEATURE 
(35) . - (35) 
X = any amino acid 



<220> 
<221> 
<222> 
35 <223> 



MISC_FEATURE 
(36) . . (36) 
X = any amino acid 



<220> 

<221> MISC_FEATURE 

40 <222> (37).. (37) 

<223> X = any amino acid or is absent 



<220> 

45 <221> MISC_FEATURE 

<222> (40) . . (40) 

<223> X = any amino acid 



50 <220> 

<221> MIS COFEATURE 

<222> (42) . . (42) 

<223> X = any amino acid 

55 

<220> 

<221> MISC__FEATURE 

<222> (43) . . (43) 

<223> X = any amino acid 
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<220> 
<221> MISC_FEATURE 
<222> (44).. (44) 
<223> X = D or E 

5 

<220> 

<221> MISC^FEATDRE 
<222> (45).. (45) 
10 <223> X = any amino acid 

<400> 3 

15 Gly Xaa Xaa His Xaa Xaa Glu His Xaa Xaa Xaa Xaa Gly Xaa Xaa Lys 
15 10 15 

Tyr Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa 
20 20 25 30 

Xaa Xaa Xaa Xaa Xaa Asn Ala Xaa Thr Xaa Xaa Xaa Xaa Thr 
35 40 45 

25 

<210> 4 
<211> 989 
<212> PRT 
30 <213> Saccharomyces cerevisiae 

<400> 4 

Met Leu Arg Phe Gin Arg Phe Ala Ser Ser Tyr Ala Gin Ala Gin Ala 
35 1 5 10 15 

Val Arg Lys Tyr Pro Val Gly Gly lie Phe His Gly Tyr Glu Val Arg 
20 25 30 

40 

Arg He Leu Pro Val Pro Glu Leu Arg Leu Thr Ala Val Asp Leu Val 
35 40 45 

45 

His Ser Gin Thr Gly Ala Glu His Leu His He Asp Arg Asp Asp Lvs 
50 55 60 

50 Asn Asn Val Phe Ser He Ala Phe Lys Thr Asn Pro Pro Asp Ser Thr 
65 70 75 80 

Gly Val Pro His He Leu Glu His Thr Thr Leu Cys Gly Ser Val Lys 
55 85 90 95 



Tyr Pro Val Arg Asp Pro Phe Phe Lys Met Leu Asn Lys Ser Leu Ala 
100 105 110 
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Asn Phe Met Asn Ala Met Thr Gly Pro Asp Tyr Thr Phe Phe Pro Phe 
115 120 125 

5 

Ser Thr Thr Asn Pro Gin Asp Phe Ala Asn Leu Arg Gly Val Tyr Leu 
130 135 140 

10 

Asp Ser Thr Leu Asn Pro Leu Leu Lys Gin Glu Asp Phe Asp Gin Glu 
145 150 155 160 

15 Gly Trp Arg Leu Glu His Lys Asn lie Thr Asp Pro Glu Ser Asn He 

165 170 175, 



Val Phe Lys Gly Val Val Tyr Asn Glu Met Lys Gly Gin He Ser Asn 
20 180 185 190 



Ala Asn Tyr Tyr Phe Trp Ser Lys Phe Gin Gin Ser He Tyr Pro Ser 
195 200 205 

25 

Leu Asn Asn Ser Gly Gly Asp Pro Met Lys He Thr Asp Leu Arg Tyr 
210 215 220 

30 

Gly Asp Leu Leu Asp Phe His His Lys Asn Tyr His Pro Ser Asn Ala 
225 230 235 240 

35 Lys Thr Phe Thr Tyr Gly Asn Leu Pro Leu Val Asp Thr Leu Lys Gin 

245 250 255 



Leu Asn Glu Gin Phe Ser Gly Tyr Gly Lys Arg Ala Arg Lys Asp Lys 
40 260 265 270 



Leu Leu Met Pro He Asp Leu Lys Lys Asp He Asp Val Lys Leu Leu 
275 280 285 

45 

Gly Gin He Asp Thr Met Leu Pro Pro Glu Lys Gin Thr Lys Ala Ser 
290 295 300 

50 

Met Thr Trp He Cys Gly Ala Pro Gin Asp Thr Tyr Asp Thr Phe Leu 
305 310 315 320 

55 Leu Lys Val Leu Gly Asn Leu Leu Met Asp Gly His Ser Ser Val Met 

325 330 335 

Tyr Gin Lys Leu He Glu Ser Gly He Gly Leu Glu Phe Ser Val Asn 
340 345 350 
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Ser Gly Val Glu Pro Thr Thr Ala Val Asn Leu Leu Thr Val Gly lie 
355 360 365 

5 

Gin Gly Val Ser Asp He Glu He Phe Lys Asp Thr Val Asn Asn He 
370 375 380 

10 Phe Gin Asn Leu Leu Glu Thr Glu His Pro Phe Asp Arg Lys Arg He 
385 390 395 400 

Asp Ala He He Glu Gin Leu Glu Leu Ser Lys Lys Asp Gin Lys Ala 
15 405 410 415 

Asp Phe Gly Leu Gin Leu Leu Tyr Ser He Leu Pro Gly Trp Thr Asn 
420 425 430 

20 

Lys He Asp Pro Phe Glu Ser Leu Leu Phe Glu Asp Val Leu Gin Arg 
435 440 445 

25 

Phe Arg Gly Asp Leu Glu Thr Lys Gly Asp Thr Leu Phe Gin Asp Leu 
450 455 460 

30 He Arg Lys Tyr He Val His Lys Pro Cys Phe Thr Phe Ser He Gin 
465 470 475 480 

Gly Ser Glu Glu Phe Ser Lys Ser Leu Asp Asp Glu Glu Gin Thr Arg 
35 485 490 495 

Leu Arg Glu Lys He Thr Ala Leu Asp Glu Gin Asp Lys Lys Asn He 
500 505 510 

40 

Phe Lys Arg Gly He Leu Leu Gin Glu Lys Gin Asn Glu Lys Glu Asp 
515 520 525 



45 



Leu Ser Cys Leu Pro Thr Leu Gin He Lys Asp He Pro Arg Ala Gly 
530 535 540 



50 Asp Lys Tyr Ser He Glu Gin Lys Asn Asn Thr Met Ser Arg He Thr 
545 550 555 560 

Asp Thr Asn Gly He Thr Tyr Val Arg Gly Lys Arg Leu Leu Asn Asp 
55 565 570 575 

He He Pro Phe Glu Leu Phe Pro Tyr Leu Pro Leu Phe Ala Glu Ser 
580 585 590 
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Leu Thr Asn Leu Gly Thr Thr Thr Glu Ser Phe Ser Glu He Glu Asp 
595 600 605 

5 Gin He Lys Leu His Thr Gly Gly He Ser Thr His Val Glu Val Thr 
610 615 620 

Ser Asp Pro Asn Thr Thr Glu Pro Arg Leu He Phe Gly Phe Asp Glv 
10 625 630 635 640 

Trp Ser Leu Asn Ser Lys Thr Asp His He Phe Glu Phe Trp Ser Lys 
^5 650 655 

He Leu Leu Glu Thr Asp Phe His Lys Asn Ser Asp Lys Leu Lys Val 
660 665 670 

20 

Leu He Arg Leu Leu Ala Ser Ser Asn Thr Ser Ser Val Ala Asp Ala 
675 680 685 

25 Gly His Ala Phe Ala Arg Gly Tyr Ser Ala Ala His Tyr Arg Ser Ser 
690 695 700 

r,r. ^■^^ lie Glu Gin Leu Gin Phe He 

30 705 710 715 720 

Asn Arg Leu His Ser Leu Leu Asp Asn Glu Glu Thr Phe Gin Arg Glu 
35 725 730 735 

Val Val Asp Lys Leu Thr Glu Leu Gin Lys Tyr He Val Asp Thr Asn 
740 745 750 



40 



Asn Met Asn Phe Phe He Thr Ser Asp Ser Asp Val Gin Ala Lys Thr 
755 760 765 



45 Val Glu Ser Gin He Ser Lys Phe Met Glu Arg Leu Pro His Gly Ser 
770 775 780 

^^"^ "^^^ Ser Asp Tyr Pro Leu He Gly Ser 

^0 785 790 795 800 

Lys Cys Lys His Thr Leu He Lys Phe Pro Phe Gin Val His Tyr Thr 
805 810 815 

Ser Gin Ala Leu Leu Gly Val Pro Tyr Thr His Lys Asp Gly Ser Ala 
820 825 830 



Leu Gin Val Met Ser Asn Met Leu Thr Phe Lys His Leu His Arg Glu 
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835 840 845 

Val Arg Glu Lys Gly Gly Ala Tyr Gly Gly Gly Ala Ser Tyr Ser Ala 
5 850 855 860 

Leu Ala Gly lie Phe Ser Phe Tyr Ser Tyr Arg Asp Pro Gin Pro Leu 
865 870 875 880 

Lys Ser Leu Glu Thr Phe Lys Asn Ser Gly Arg Tyr lie Leu Asn Asp 
885 890 895 

15 

Ala Lys Trp Gly Val Thr Asp Leu Asp Glu Ala Lys Leu Thr lie Phe 
900 905 910 

20 Gin Gin Val Asp Ala Pro Lys Ser Pro Lys Gly Glu Gly Val Thr Tyr 
915 920 925 

Phe Met Ser Gly Val Thr Asp Asp Met Lys Gin Ala Arg Arg Glu Gin 
25 930 935 940 

Leu Leu Asp Val Ser Leu Leu Asp Val His Arg Val Ala Glu Lys Tyr 
3Q 950 955 960 

Leu Leu Asn Lys Glu Gly Val Ser Thr Val He Gly Pro Gly He Glu 
965 970 975 

35 

Gly Lys Thr Val Ser Pro Asn Trp Glu Val Lys Glu Leu 
980 985 

40 <210> 5 
<211> 882 
<212> PRT 

<213> Schizosaccharomyces pombe 
45 <400> 5 

Met Asn Tyr Ala Lys Leu Ser He Ala Phe Ser Lys Lys Thr He Lys 
1 5 10 15 

50 

Thr His Asn Cys Arg Leu Phe Gin Arg Trp Leu His Val Gly Asp Lys 
20 25 30 

Val His Asp Phe Arg Val Val Asp Thr Lys Lys Val Pro Glu Leu Gin 
55 35 40 45 



Leu Asn Tyr Thr Arg Leu Lys His Glu Pro Thr Asn Ala Asp Met He 
50 55 60 
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His Leu Asp Arg Glu Asp Pro Asn Ser Val Phe Ser lie Gly Phe Gin 
65 70 75 80 

5 

Thr Pro Ala Glu Asn Asp Glu Gly lie Pro His lie Leu Glu His Thr 
85 90 95 

10 

Thr Leu Cys Gly Ser Asn Lys Tyr Pro Val Arg Asp Pro Phe Phe Lys 
100 105 110 



15 Met Leu Asn Arg Ser Leu Ala Thr Phe Met Asn Ala Phe Thr Ala Ser 
115 120 125 



Asp Phe Thr Phe Tyr Pro Phe Ala Thr Val Asn Thr Thr Asp Tyr Lys 
20 130 135 140 



Asn Leu Arg Asp Val Tyr Leu Asp Ala Thr Leu Phe Pro Lys Leu Arg 
145 150 155 160 

25 

Lys Leu Asp Phe Leu Gin Glu Gly Trp Arg Phe Glu His Ala Asp Val 
165 170 175 

30 

Asn Asp Lys Lys Ser Pro lie lie Phe Asn Gly Val Val Tyr Asn Glu 
180 185 190 



35 Met Lys Gly Gin Val Ser Asp Ser Ser Tyr lie Phe Tyr Met Leu Phe 
195 200 205 



Gin Gin His Leu Phe Gin Gly Thr Ala Tyr Gly Phe Asn Ser Gly Gly 
40 210 215 220 



Asp Pro Leu Ala He Pro Asp Leu Lys Tyr Glu Glu Leu Val Lys Phe 
225 230 235 240 

45 

His Arg Ser His Tyr His Pro Ser Asn Ala Lys He Leu Ser Tyr Gly 
245 250 255 

50 

Ser Phe Pro Leu Glu Asp Asn Leu Ser Ala Leu Ser Glu Thr Phe Arg 
260 265 270 

Pro Phe Ser Lys Arg Glu Leu Asn Leu Pro Asn Thr Phe Leu Lys Glu 
275 280 285 

55 



Phe Asp Gin Glu Lys Arg Val Val Glu Tyr Gly Pro Leu Asp Pro Val 
290 295 300 
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Met Ala Pro Gly Arg Gin Val Lys Thr Ser He Ser Phe Leu Ala Asn 
305 310 315 320 

5 

Asp Thr Ser Asn Val Tyr Glu Thr Phe Ala Leu Lys Val Leu Ser Lys 
325 330 335 

10 Leu Cys Phe Asp Gly Phe Ser Ser Pro Phe Tyr Lys Ala Leu He Glu 
340 345 350 

Ser Gly Leu Gly Thr Asp Phe Ala Pro Asn Ser Gly Tyr Asp Ser Thr 
15 355 360 365 



20 



Thr Lys Arg Gly He Phe Ser Val Gly Leu Glu Gly Ala Ser Glu Glu 
370 375 380 



25 



Ser Leu Ala Lys He Glu Asn Leu Val Tyr Ser He Phe Asn Asp Leu 

385 390 395 400 

Ala Leu Lys Gly Phe Glu Asn Glu Lys Leu Glu Ala He Leu His Gin 

405 410 415 



30 Met Glu He Ser Leu Lys His Lys Ser Ala His Phe Gly He Gly Leu 
420 425 430 



Ala Gin Ser Leu Pro Phe Asn Trp Phe Asn Gly Ala Asp Pro Ala Asp 
35 435 440 



445 



40 



Trp Leu Ser Phe Asn Lys Gin He Glu Trp Leu Lys Gin Lys Asn Ser 
450 455 460 



45 



Asp Gly Lys Leu Phe Gin Lys Leu He Lys Lys Tyr He Leu Glu Asn 
465 470 475 480 

Lys Ser Arg Phe Val Phe Thr Met Leu Pro Ser Ser Thr Phe Pro Gin 
485 490 495 



50 Arg Leu Gin Glu Ala Glu Ala Lys Lys Leu Gin Glu Arg Thr Ser Lys 
500 505 510 

Leu Thr Asp Glu Asp He Ala Glu He Glu Lys Thr Ser Val Lys Leu 
515 520 525 



55 



Leu Glu Ala Gin Ser Thr Pro Ala Asp Thr Ser Cys Leu Pro Thr Leu 
530 535 540 
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Ser Val Ser Asp lie Pro Glu Thr lie Asp Glu Thr Lys Leu Lys Phe 
545 550 555 560 

5 Leu Asp lie Ala Gly Met Lys Ala Gin Trp Tyr Asp Leu Ala Ala Gly 

565 570 575 

Leu Thr Tyr He Arg Leu Leu Leu Pro Leu Lys Asn Phe Pro Glu Ser 
10 580 585 590 

Leu He Pro Tyr Leu Pro Val Tyr Cys Asp Ala Cys Leu Asn Leu Gly 
595 600 605 

Thr His Ser Glu Ser He Gly Asp Leu Glu His Gin He Arg Arq Tvr 
610 615 620 



20 



Thr Gly Gly He Ser He Ser Pro Ser Ala Val Thr Asn Asn Ser Asp 
625 630 635 640 



25 Val Ser Lys Tyr Glu Leu Gly He Ala He Ser Gly Tyr Ala Leu Asp 

645 650 655 

Lys Asn Val Gly Lys Leu Val Glu Leu He Asn Lys Ala Phe Trp Asn 
30 660 665 670 

Thr Asn Leu Ser Asn Thr Asp Lys Leu Ala He Met Leu Lys Thr Ser 
675 680 685 

35 

Val Ser Gly He Thr Asp Gly He Ala Glu Lys Gly His Ser Phe Ala 
690 695 700 

40 

Lys Val Ser Ser Ala Ser Gly Leu Thr Glu Lys Thr Ser He Thr Glu 
705 710 715 720 

45 Gin Leu Gly Gly Leu Thr Gin Val Lys Leu Leu Ser Gin Leu Ser Arg 

725 730 735 

Glu Glu Ser Phe Gly Pro Leu Val Glu Lys Leu Thr Ala He Arg Glu 
50 740 745 750 

He Leu Arg Gly Thr Ser Gly Phe Lys Ala Ala He Asn Ala Ser Pro 
755 760 765 

55 Thr Gin His Glu Val Val Glu Lys Ala Leu Gin Lys Phe Met Lys Ser 
770 775 780 



Arg Gly Val Asn Gin Gin Thr Gin Thr Lys Ser Thr Ser Lys Glu Arg 
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"785 790 795 800 

Asn Gly lie Asn Ser lie Lys Thr Tyr His Glu Leu Pro Phe Gin Thr 
5 805 810 815 

Tyr Phe Ala Ala Lys Ser Cys Leu Gly Val Pro Tyr Thr His Pro Asp 
820 825 830 

10 

Gly Ala Pro Leu Gin lie Leu Ser Ser Leu Leu Thr His Lys Tyr Leu 
835 840 845 

15 

His Gly Glu lie Arg Glu Lys Gly Gly Ala Tyr Gly Ala Gly Leu Ser 
850 855 860 

20 Tyr Ser Gly lie Asp Gly Val Leu Ser Phe Phe Thr Tyr Arg Asp Ser 
865 870 875 880 

Asp Pro 

25 



<210> 6 

<211> 973 

30 <212> PRT 

<213> Clostridium perfringens 

<400> 6 

35 Met Asn Phe Lys Glu Asn Asn He Tyr Ser Gly Phe Lys Leu Leu Asn 
15 10 15 

He Glu Asn Leu Asn Glu He Gly Gly Val Gly Leu Arg Phe Glu His 
20 25 30 

Glu Lys Thr Lys Ala Lys Leu He Lys He Leu Ser Glu Asp Asp Asn 
35 40 45 

45 

Lys Cys Phe Ala He Gly Phe Arg Thr Pro Pro Glu Asn Ser Thr Gly 
50 55 60 

50 Val Pro His He Leu Glu His Ser Val Leu Cys Gly Ser Arg Lys Phe 
65 70 75 80 

Asn Thr Lys Glu Pro Phe Val Glu Leu Leu Lys Gly Ser Leu Asn Thr 
55 85 90 95 



Phe Leu Asn Ala Met Thr Tyr Pro Asp Lys Thr He Tyr Pro Val Ala 
100 105 110 
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Ser Arg Asn Glu Lys Asp Phe Met Asn Leu Met Asp Val Tyr Leu Asp 
115 120 125 

5 

Ala Val Leu Tyr Pro Asn lie Tyr Lys His Lys Glu lie Phe Met Gin 
130 135 140 

10 

Glu Gly Trp His Tyr Tyr He Glu Asn Lys Glu Asp Glu Leu Lys Tyr 
145 150 155 160 



15 Asn Gly Val Val Tyr Asn Glu Met Lys Gly Ala Tyr Ser Ser Pro Asp 

165 170 175 



Ser He Leu Tyr Arg Lys He Pro Gin Thr He Tyr Pro Asp Thr Cys 
20 180 185 190 



Tyr Ala Leu Ser Ser Gly Gly Asp Pro Asp Glu He Pro Asn Leu Thr 
195 200 205 

25 

Tyr Glu Glu Phe Val Glu Phe His Lys Lys Tyr Tyr His Pro Ser Asn 
210 215 220 

30 

Ser Tyr He Phe Leu Tyr Gly Asn Gly Asp Thr Glu Lys Glu Leu Glu 
225 230 235 240 



35 Phe He Asn Glu Glu Tyr Leu Lys Asn Phe Glu Tyr Lys Glu He Asp 

245 250 255 



Ser Glu He Lys Glu Gin Lys Ser Phe Glu Ser Met Lys Glu Glu Ser 
40 260 265 270 



Phe Thr Tyr Gly He Ala Glu Ser Glu Asp Leu Asn His Lys Ser Tyr 
275 280 285 

45 

Tyr Ser Leu Asn Phe Val He Gly Asp Ala Thr Asp Gly Glu Lys Gly 
290 295 300 

Leu Ala Phe Asp Val Leu Ala Tyr Leu Leu Thr Arg Ser Thr Ala Ala 
50 305 310 315 320 



Pro Leu Lys Lys Ala Leu He Asp Ala Gly He Gly Lys Ala Val Ser 
325 330 335 

55 

Gly Asp Phe Asp Asn Ser Thr Lys Gin Ser Ala Phe Thr Val Leu Val 
340 345 350 
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Lys Asn Ala Glu Leu Asn Lys Glu Glu Glu Phe Lys Lys Val Val Met 
355 360 365 

5 

Asp Thr Leu Lys Asp Leu Val Glu Asn Gly He Asp Lys Glu Leu He 
370 375 380 

10 Glu Ala Ser He Asn Arg Val Glu Phe Glu Leu Arg Glu Gly Asp Tyr 
385 390 395 400 

Gly Ser Tyr Pro Asn Gly Leu He Tyr Tyr Leu Lys Val Met Asp Ser 
15 405 410 415 



20 



Trp Leu Tyr Asp Gly Asp Pro Tyr Val His Leu Glu Tyr Glu Lys Asn 
420 425 430 



25 



Leu Glu Lys He Lys Ser Ala Leu Thr Ser Asn Tyr Phe Glu Asp Leu 
435 440 445 



He Glu Arg Tyr Met He Asn Asn Thr His Ser Ser Leu Val Ser Leu 
450 455 460 



30 His Pro Glu Lys Gly He Asn Glu Lys Lys Ser Ala Glu Leu Lys Lys 

465 470 475 480 

Lys Leu Glu Glu He Lys Asn Ser Phe Asp Glu Lys Thr Leu Asn Glu 

35 485 490 495 



40 



He He Asp Asn Cys Lys Lys Leu Lys Glu Arg Gin Ser Thr Pro Asp 
500 505 510 



Lys Lys Glu Asp Leu Glu Ser He Pro Met Leu Ser Leu Glu Asp He 
515 520 525 



45 



50 



Asp Lys Glu Ala Thr Lys He Pro Thr Glu Glu Lys Glu He Asp Gly 
530 535 540 

He Thr Thr Leu His His Asp Phe His Thr Asn Lys He Asp Tyr Val 
545 550 555 560 



55 



Asn Phe Phe Phe Asn Thr Asn Ser Val Pro Glu Asp Leu He Pro Tyr 
565 570 575 

Val Gly Leu Leu Cys Asp He Leu Gly Lys Cys Gly Thr Glu Asn Tyr 
580 585 590 
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Asp Tyr Ser Lys Leu Ser Asn Ala lie Asn He Ser Thr Gly Gly He 
595 600 605 

5 Ser Phe Gly Ala He Thr Phe Ala Asn Leu Lys Lys Asn Asn Glu Phe 
610 615 620 

Arg Pro Tyr Leu Glu He Ser Tyr Lys Ala Leu Ser Ser Lys Thr Asn 
10 625 630 635 640 

Lys Ala He Glu Leu Val Asp Glu He Val Asn His Thr Asp Leu Asp 
645 650 655 

15 

Asp Met Asp Arg He Met Gin He He Arg Glu Lys Arg Ala Arg Leu 
660 665 670 

20 

Glu Gly Ala He Phe Asp Ser Gly His Arg He Ala Met Lys Lys Val 
675 680 685 

25 Leu Ser Tyr Ser Thr Asn Arg Gly Ala Tyr Asp Glu Lys He Ser Gly 
690 695 700 

Leu Asp Tyr Tyr Asp Phe Leu Val Asn He Glu Lys Glu Asp Lys Lys 
30 705 710 715 720 

Ser Thr He Ser Asp Ser Leu Lys Lys Val Arg Asp Leu He Phe Asn 
725 730 735 

35 

Lys Gly Asn Met Leu He Ser Tyr Ser Gly Lys Glu Glu Glu Tyr Glu 
740 745 750 

40 

Asn Phe Lys Glu Lys Val Lys Tyr Leu He Ser Lys Thr Asn Asn Asn 
755 760 765 

45 Asp Phe Glu Lys Glu Glu Tyr Asn Phe Glu Leu Gly Lys Lys Asn Glu 
770 775 780 

Gly Leu Leu Thr Gin Gly Asn Val Gin Tyr Val Ala Lys Gly Gly Asn 
785 790 795 800 

50 

Tyr Lys Thr His Gly Tyr Lys Tyr Ser Gly Ala Leu Ser Leu Leu Glu 
805 810 815 

55 Ser He Leu Gly Phe Asp Tyr Leu Trp Asn Ala Val Arg Val Lys Gly 
820 825 830 



Gly Ala Tyr Gly Val Phe Ser Asn Phe Arg Arg Asp Gly Gly Ala Tyr 
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835 840 845 

He Val Ser Tyr Arg Asp Pro Asn He Lys Ser Thr Leu Glu Ala Tyr 
5 850 855 860 

Asp Asn He Pro Lys Tyr Leu Asn Asp Phe Glu Ala Asp Glu Arg Glu 
865 870 875 880 

10 

Met Thr Lys Tyr He He Gly Thr He Arg Lys Tyr Asp Gin Pro He 
885 890 895 

15 

Ser Asn Gly He Lys Gly Asp He Ala Val Ser Tyr Tyr Leu Ser Asn 
900 905 910 

20 Phe Thr Tyr Glu Asp Leu Gin Lys Glu Arg Glu Glu He He Asn Ala 
915 920 925 

Asp Val Glu Lys He Lys Ser Phe Ala Pro Met He Lys Asp Leu Met 
25 930 935 940 

Lys Glu Asp Tyr He Cys Val Leu Gly Asn Glu Glu Lys He Lys Glu 
945 950 955 960 

30 

Asn Lys Asp Leu Phe Asn Asn He Lys Ser Val He Lys 
965 970 

35 

<210> 7 
<211> 971 
<212> PRT 

<213> Borrelia burgdorferi 

40 

<400> 7 

Met Lys Lys Lys Lys He Phe Lys Leu He Ser Lys Thr Tyr Leu Glu 
15 10 15 

45 

Glu His Asp Ala Glu Gly Tyr Tyr Phe Lys His Glu Ser Gly Leu Glu 
20 25 30 

50 Val Phe His Leu Lys Ser Asp Ser Phe Lys Glu Asn Ala Phe Cys He 
35 40 45 

Ala Phe Lys Thr He Pro Ser Asn Asn Thr Gly Val Ala His Val Leu 
55 50 55 60 



Glu His Thr He Phe Cys Gly Ser Ser Lys Tyr Lys He Lys Asp Pro 
65 70 75 80 
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Phe Leu Tyr Leu Leu Lys Gly Ser Leu Asn Thr Phe Leu Asn Ala Met 
85 90 95 

5 

Thr Phe Pro Asp Lys Thr lie Tyr Pro Ala Ala Ser Thr lie Glu Lys 
100 105 110 

10 

Asp Tyr Phe Asn Leu Phe Asn He Tyr Ala Asp Ser He Phe Asn Pro 
115 120 125 

15 Leu Leu Lys Lys Glu Ser Phe Met Gin Glu Gly Tyr Asn He Asn Pro 
130 135 140 



Lys Asp Phe Lys Val Ser Gly He Val Phe Asn Glu Met Lys Gly Ser 
20 145 150 155 160 



Tyr Ser Asn Lys Asn Ser Leu He Asn Glu He Val Ser Ser Ser Leu 
165 170 175 

25 

Phe Glu Glu Gly Ala Tyr Lys Tyr Asp Ser Gly Gly He Pro Thr Asn 
180 185 190 

30 

He He Asp Leu Thr Tyr Glu Ser Phe Leu Asp Phe Tyr Lys Lys Tyr 
195 200 205 

35 Tyr Thr Leu Glu Asn Cys Lys He Phe Leu Cys Gly Asn Thr Gin Thr 
210 215 220 



Glu Lys Asn Leu Asn Phe He Glu Lys Tyr He He Arg Pro Tyr Lys 
40 225 230 235 240 



Lys Glu Lys Ser Asn Val Asn He Asn He Glu Asn Val Lys Arg Trp 
245 250 255 

45 Glu Lys Gly Lys Lys Leu Thr Tyr Lys He Pro Lys Glu Asn Asp Asn 
260 265 270 



Ser Leu Gly Val Tyr Thr He Asn Trp Leu Cys Thr Glu He Asn Asn 
50 275 280 285 



He Glu Asp Ser He Gly Leu Glu He Leu Ser Glu He Leu Leu Asp 
290 295 300 

55 



Asp Ser Cys Ser Phe Thr He Asn He Leu Lys Ser Gly He Gly Glu 
305 310 315 320 
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Asp lie Ala His lie Ser Gly lie Asn Thr Asp Leu Lys Glu Ser lie 
325 330 335 

5 

Phe Ser Phe Gly Leu Gin Asn Val Val Glu Asn Lys Glu Lys Glu Phe 
340 345 350 

10 Lys Asn Leu Val Phe Ser Glu Leu Lys Asn Leu Val Lys Asn Lys lie 
355 360 365 

Pro Lys Glu Leu lie Lys Gly He Leu Phe Gly Tyr Glu Phe Ala Leu 
15 370 375 380 

Lys Glu Glu Lys Gly Gin Asn Phe Pro He Ala Leu Met He Lys Ser 
385 390 395 400 

20 

Phe Lys Gly Trp Leu Asn Gly Leu His Pro He Lys Thr Leu Gin Thr 
405 410 415 

25 

Ser Tyr Tyr He Asn Glu He Thr Asn Lys Leu Glu Lys Gly He Tyr 
420 425 430 

30 Tyr Phe Glu Asn Leu He Glu Lys Tyr Leu He Phe Asn Asn His Tyr 
435 440 445 

Thr Leu He Ser Phe He Pro Ser His Asp Thr Glu Lys Glu Met Glu 
35 450 455 4 60 

Glu Glu He Glu Lys Lys Leu Met Ala Arg Glu He Glu He Lys Gin 
465 470 475 480 

40 

Asn Pro Glu Glu Phe Leu Gin Phe Lys Lys Asp Tyr Asn Gin Phe Lys 
485 490 495 

Lys Tyr Gin Asn Lys Lys Asp Ser Lys Ala Asp He Ala Lys Leu Pro 
45 500 505 510 

Leu Leu Lys He Glu Asp Leu Pro Lys Gin He Glu Lys Ser Leu Asp 
515 520 525 

50 

Leu Asn Glu He Lys Glu Leu Asn Leu His Ser Phe Lys Phe Lys Ser 
530 535 540 



55 



Asn Asn He Phe Asn Val Asn Leu Phe Phe Lys Leu Asp Phe Leu Glu 
545 550 555 560 
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Lys Glu Asp Tyr lie Tyr Leu Ser Leu Phe Lys Arg Ala Leu Gin Asp 
565 570 575 



5 Leu Ser Thr Lys Asn Tyr Ser Tyr lie Asn lie Asn Asn Lys lie Gin 
580 585 590 



Asn Thr Leu Gly Gin lie Asn lie Ser Glu Ser Tyr Asp Glu Asp lie 
10 595 600 605 



Asp Gly Asn lie Leu Asn Ser Phe Asn lie Ser Phe Lys Ser Phe Asn 
610 615 620 

15 

Asn Lys Val Lys Glu Ser Phe Glu Leu lie Lys Glu lie Leu lie Asn 
625 630 635 640 

20 

lie Asn Phe His Asp Tyr Glu Arg Leu Lys Glu lie Thr Leu Ser Leu 
645 650 655 



25 Lys Asn Asp Phe Lys Ser Leu Leu lie Pro Lys Gly His Leu Leu Ala 
660 665 670 



Met Leu Arg Ser Lys Ser Lys Leu Lys Leu Asn Glu Tyr Leu Lys Glu 
30 675 680 685 



Leu Gin Asn Gly lie Thr Gly Arg Glu Phe Trp Gin Lys Ala Lys Thr 
690 695 700 

35 

Asp Thr Glu Ser Leu Lys Glu lie Ala Asn Lys Leu Asp Asn Leu Lys 
705 710 715 720 

40 

Asn Lys He He Leu Lys Asn Asn Leu Ser Ala Leu He Met Gly Asn 
725 730 735 

Thr Asp Asp He Leu Lys Asn Leu Glu Asn Glu Phe Phe Asn Leu Lys 
740 745 750 

45 

Glu Ser Leu Glu Glu Ser Asn His Tyr Asn Gly Leu Leu Asn Leu Asp 
755 760 765 

50 

Ala Asn Ser Lys Ala Leu Arg Glu He He He He Gin Ser Lys Val 
770 775 780 



55 Ala Phe Asn Ala He Cys Phe Pro Ser Tyr Lys He Asn Asp Glu Asn 
785 790 795 800 



Tyr Pro Lys Ala Asn Phe Leu Glu His Val Leu Arg Ser Gly He Phe 
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805 810 815 



Trp Glu Lys lie Arg Val Met Gly Gly Ala Tyr Gly Ala Ser Ala Ser 
5 820 825 830 



lie Ala Asn Gly He Phe Ser Phe Ala Ser Tyr Arg Asp Pro Asn Phe 
835 840 845 

10 

Thr Lys Thr Tyr Gin Ala Phe Glu Lys Ser Leu Glu Glu Leu Ala Asn 
850 855 860 

15 

Asn Lys Met Thr Asp Asp Glu He Tyr Thr Tyr Leu He Gly Leu He 
865 870 875 880 



20 Gly Thr Asn He Tyr Val Lys Thr Lys Ala Thr Glu Ala Leu Gin Ser 

885 890 895 



Tyr Arg Arg Lys Met Leu Asn He Ser Asp Ser Leu Arg Gin Asp He 
25 900 905 910 



Arg Asn Ala Tyr Phe Thr He Thr Pro Gin Asp He Lys Glu He Ser 
915 920 925 

30 

Thr Lys He Leu Thr Gin He Arg Gin His Asn Ser He Ala Ser Leu 
930 935 940 

35 

Val Asn Asn Gin He Tyr Glu Glu Glu Lys Asn Asn Leu Glu Lys Leu 
945 950 955 960 



40 He Gly Lys Glu Tyr Ser Leu Lys Lys He Tyr 

965 970 

<210> 8 
<211> 995 
<212> PRT 
45 <213> Caenorhabditis elegans 

<400> 8 

Met Ser Ala Ser Lys Leu Trp Ser Cys Thr Glu Thr Val Leu Asn Gly 
50 1 5 10 15 



Gly He Lys Leu Phe Leu Tyr Ser Ser Lys Asn Thr Lys Leu Arg Val 
20 25 30 

55 



Ala He Gly Glu Val Pro Gly Pro Met Val. His Gly Ala Val Ser Phe 
35 40 45 
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Val Thr Glu Ala Asp Ser Asp Asp Gly Leu Pro His Thr Leu Glu His 
50 55 60 

5 

Leu Val Phe Met Gly Ser Lys Lys Tyr Pro Phe Lys Gly Val Leu Asp 
65 70 75 80 

10 Val lie Ala Asn Arg Cys Leu Ala Asp Gly Thr Asn Ala Trp Thr Asp 

85 90 95 

Thr Asp His Thr Ala Tyr Thr Leu Ser Thr Val Gly Ser Asp Gly Phe 
15 100 105 110 

Leu Lys Val Leu Pro Val Tyr lie Asn His Leu Leu Thr Pro Met Leu 
115 120 125 

20 

Thr Ala Ser Gin Phe Ala Thr Glu Val His His lie Thr Gly Glu Gly 
130 135 140 

25 

Asn Asp Ala Gly Val Val Tyr Ser Glu Met Gin Asp His Glu Ser Glu 
145 150 155 160 

30 Met Glu Ser lie Met Asp Arg Lys Thr Lys Glu Val lie Tyr Pro Pro 

165 170 175 

Phe Asn Pro Tyr Ala Val Asp Thr Gly Gly Arg Leu Lys Asn Leu Arg 
35 180 185 190 

Glu Ser Cys Thr Leu Glu Lys Val Arg Asp Tyr His Lys Lys Phe Tyr 
195 200 205 

40 

His Leu Ser Asn Met Val Val Thr Val Cys Gly Met Val Asp His Asp 
210 215 220 

45 Gin Val Leu Glu lie Met Asn Asn Val Glu Asn Glu His Met Ser Thr 
225 230 235 240 

Val Pro Asp His Phe Pro Lys Pro Phe Ser Phe Ala Leu Ser Asp lie 
50 245 250 255 

Lys Glu Ser Thr Val His Arg Val Glu Cys Pro Thr Asp Asp Ala Ser 
260 265 270 



Arg Gly Ala Val Glu Val Ala Trp Phe Ala His Ser Pro Ser Asp Leu 
275 280 285 
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Glu Thr His Ser Ser Leu His Val Leu Phe Asp Tyr Leu Ser Asn Thr 
290 295 300 

5 

Ser Val Ala Pro Leu Gin Lys Asp Phe lie Leu Leu Glu Asp Pro Leu 
305 310 315 320 



10 Ala Ser Ser Val Ser Phe His lie Ala Glu Gly Val Arg Cys Asp Leu 

325 330 335 



Arg Leu Asn Phe Ala Gly Val Pro Val Glu Lys Leu Asp Glu Cys Ala 
15 340 345 350 



Pro Lys Phe Phe Asp Lys Thr Val Arg Glu His Leu Glu Glu Ala Asn 
355 360 365 

20 

Phe Asp Met Glu Arg Met Gly Tyr Leu lie Asp Gin Thr lie Leu Asn 
370 375 380 

25 

Glu Leu Val Lys Leu Glu Thr Asn Ala Pro Lys Asp lie Met Ser His 
385 390 395 400 



30 lie lie Gly His Gin Leu Phe Asp Asn Glu Asp Glu Glu Leu Phe Lys 

405 410 415 



Lys Arg Thr Asn Glu lie Asp Phe Leu Lys Lys Leu Lys Ser Glu Pro 
35 420 425 430 



Ala Ser Tyr Trp Val Gin Leu Val Asn Lys Tyr Phe Thr Ala Pro Ser 
435 440 445 

40 Ala Thr Val lie Gly Val Pro Asn Glu Glu Leu Val Asp Lys lie Ala 
450 455 460 



Glu Glu Glu Glu Lys Arg lie Ala Ala Gin Cys Glu Lys Leu Gly Lys 
45 465 470 475 480 



Lys Gly Leu Glu Glu Ala Gly Lys Ser Leu Glu Ala Ala lie Leu Glu 
485 490 495 

50 

Asn Thr Ala Asn His Pro Ser Ala Glu Leu Leu Asp Gin Leu lie Val 
500 505 510 

55 

Lys Asp Leu Glu Ala Phe Asp Arg Phe Pro Val Gin Ser Leu Thr Ser 
515 520 525 
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Asn Ser Pro Ser Leu Thr Pro Gin Gin Ser Thr Phe Leu Ala Gin Phe 
530 535 540 

5 Pro Phe His Ala Asn Leu His Asn Cys Pro Thr Lys Phe Val Glu He 

550 555 560 

Phe Phe Leu Leu Asp Ser Ser Asn Leu Ser He Glu Asp Arg Ser Tyr 
10 565 570 575 

Leu Phe Leu Tyr Thr Asp Leu Leu Phe Glu Ser Pro Ala Met He Asp 
580 585 590 

Gly Val Leu Thr Ser Ala Asp Asp Val" Ala Lys His Phe Thr Lys Asp 
595 600 605 



20 



Leu He Asp His Ser He Gin Val Gly Val Ser Gly Leu Tyr Asp Arq 
610 615 620 ^ y 



25 Phe Val Asn Leu Arg He Lys Val Gly Ala Asp Lys Tyr Pro Leu Leu 
"5 630 635 640 

Ala Lys Trp Ala Gin He Phe Thr Gin Gly Val Val Phe Asp Pro Ser 
30 645 650 655 

Arg He His Gin Cys Ala Gin Lys Leu Ala Gly Glu Ala Arg Asp Arg 
660 665 670 

35 

Lys Arg Asp Gly Cys Thr Val Ala Ser Thr Ala Val Ala Ser Met Val 
675 680 685 

Tyr Gly Lys Asn Thr Asn Cys He Leu Phe Asp Glu Leu Val Leu Glu 
40 690 695 700 

Lys Leu His Glu Lys He Ser Lys Asp Val Met Lys Asn Pro Glu Ala 
705 710 715 720 

45 

Val Leu Glu Lys Leu Glu Gin Val Arg Ser Ala Leu Phe Ser Asn Gly 
725 730 735 

50 

Val Asn Ala His Phe Val Ala Asp Val Asp Ser He Asp Pro Lys Met 
740 745 750 

55 Leu Ser Ser Asp Leu Trp Thr Trp Val Gin Ala Asp Pro Arg Phe Gly 
755 760 765 



Pro Gly His Gin Phe Ser Ala Glu Ala Gly Glu Asn Val Ser Leu Glu 
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775 780 

Leu Gly Lys Glu Leu Leu He Gly Val Gly Gly Sex Glu Ser Ser Phe 

^ 790 795 800 

He Tyr Gin Thr Ser Phe Leu Asp Ala Asn Trp Asn Ser Glu Glu Leu 
10 810 815 

He Pro Ala Met He Phe Gly Gin Tyr Leu Ser Gin Cys Glu Gly Pro 

825 830 



15 



Leu Trp Arg Ala He Arg Gly Asp Gly Leu Ala Tyr Gly Ala Asn Val 
835 840 845 



20 Phe Val Lys Pro Asp Arg Lys Gin He Thr Leu Ser Leu Tyr Arg Cys 
850 855 860 ^ -=> ^ 

25 ^et ""^^ ^''^ ""^^ ■'^^^ Asp He He Arg Lys 

870 875 880 

He Val Glu Ser Gly Glu He Ser Lys Ala Glu Phe Glu Gly Ala Lys 
30 890 895 

Arg Ser Thr Val Phe Glu Met Met Lys Arg Glu Gly Thr Val Ser Gly 
900 905 910 

35 

Ala Ala Lys He Ser He Leu Asn Asn Phe Arg Gin Thr Pro His Pro 

920 325 
Phe Asn He Asp Leu Cys Arg Arg He Trp Asn Leu Thr Ser Glu Glu 
40 ^35 940 

Met val Lys He Gly Gly Pro Pro Leu Ala Arg Leu Phe Asp Glu Lys 

®50 955 



45 



Cys Phe Val Arg Ser He Ala Val His Pro Ser Lys Leu Asn Glu Met 
965 970 



50 Lys Lys Ala Phe Pro Gly Ser Ser Lys He Lys He Ser Asp Leu Gin 
380 985 990 

Phe Ala Cys 
55 995 



<210> 
<211> 



9 

962 
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<212> PRT 

<213> Escherichia coli 
<400> 9 

5 

Met Pro Arg Ser Thr Trp Phe Lys Ala Leu Leu Leu Leu Val Ala Leu 
15 10 15 

10 Trp Ala Pro Leu Ser Gin Ala Glu Thr Gly Trp Gin Pro lie Gin Glu 
20 25 30 



Thr He Arg Lys Ser Asp Lys Asp Asn Arg Gin Tyr Gin Ala He Arg 
15 35 40 45 

Leu Asp Asn Gly Met Val Val Leu Leu Val Ser Asp Pro Gin Ala Val 
50 55 60 

20 

Lys Ser Leu Ser Ala Leu Val Val Pro Val Gly Ser Leu Glu Asp Pro 
65 70 75 80 

25 

Glu Ala Tyr Gin Gly Leu Ala His Tyr Leu Glu His Met Ser Leu Met 
85 90 95 



30 Gly Ser Lys Lys Tyr Pro Gin Ala Asp Ser Leu Ala Glu Tyr Leu Lys 
100 105 110 



Met His Gly Gly Ser His Asn Ala Ser Thr Ala Pro Tyr Arg Thr Ala 
35 115 120 125 

Phe Tyr Leu Glu Val Glu Asn Asp Ala Leu Pro Gly Ala Val Asp Arg 
130 135 140 

40 

Leu Ala Asp Ala He Ala Glu Pro Leu Leu Asp Lys Lys Tyr Ala Glu 
145 150 155 160 

45 Arg Glu Arg Asn Ala Val Asn Ala Glu Leu Thr Met Ala Arg Thr Arg 

165 170 175 

Asp Gly Met Arg Met Ala Gin Val Ser Ala Glu Thr He Asn Pro Ala 
50 180 185 190 



His Pro Gly Ser Lys Phe Ser Gly Gly Asn Leu Glu Thr Leu Ser Asp 
195 200 205 

55 

Lys Pro Gly Asn Pro Val Gin Gin Ala Leu Lys Asp Phe His Glu Lys 
210 215 220 
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Tyr Tyr Ser Ala Asn Leu Met Lys Ala Val He Tyr Ser Asn Lys Pro 
225 230 235 240 

5 

Leu Pro Glu Leu Ala Lys Met Ala Ala Asp Thr Phe Gly Arg Val Pro 
245 250 255 

10 Asn Lys Glu Ser Lys Lys Pro Glu He Thr Val Pro Val Val Thr Asp 
260 265 270 



15 



Ala Gin Lys Gly He He He His Tyr Val Pro Ala Leu Pro Arg Lys 



275 



280 



285 



20 



Val Leu Arg Val Glu Phe Arg He Asp Asn Asn Ser Ala Lys Phe Ara 
290 295 300 



25 



Ser Lys Thr Asp Glu Leu He Thr Tyr Leu He Gly Asn Arg Ser Pro 
305 . 310 315 320 

Gly Thr Leu Ser Asp Trp Leu Gin Lys Gin Gly Leu Val Glu Gly He 
325 330 335 



30 Ser Ala Asn Ser Asp Pro He Val Asn Gly Asn Ser Gly Val Leu Ala 
340 345 350 

He Ser Ala Ser Leu Thr Asp Lys Gly Leu Ala Asn Arg Asp Gin Val 
35 355 360 365 

Val Ala Ala He Phe Ser Tyr Leu Asn Leu Leu Arg Glu Lys Glv He 
370 375 380 



40 Asp Lys Gin Tyr Phe Asp Glu Leu Ala Asn Val Leu Asp He Asp Phe 
385 390 395 



400 



45 



Arg Tyr Pro Ser He Thr Arg Asp Met Asp Tyr Val Glu Trp Leu Ala 



405 



410 



415 



50 



Asp Thr Met He Arg Val Pro Val Glu His Thr Leu Asp Ala Val Asn 
420 425 430 



55 



He Ala Asp Arg Tyr Asp Ala Lys Ala Val Lys Glu Arg Leu Ala Met 
435 440 445 

Met Thr Pro Gin Asn Ala Arg He Trp Tyr He Ser Pro Lys Glu Pro 
450 455 460 
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His Asn Lys Thr Ala Tyr Phe Val Asp Ala Pro Tyr Gin Val Asp Lys 
465 470 475 480 

5 He Ser Ala Gin Thr Phe Ala Asp Trp Gin Lys Lys Ala Ala Asp He 

485 490 495 

Ala Leu Ser Leu Pro Glu Leu Asn Pro Tyr He Pro Asp Asp Phe Ser 
10 500 505 510 

Leu He Lys Ser Glu Lys Lys Tyr Asp His Pro Glu Leu He Val Asp 
515 520 525 

15 

Glu Ser Asn Leu Arg Val Val Tyr Ala Pro Ser Arg Tyr Phe Ala Ser 
530 535 540 

20 

Glu Pro Lys Ala Asp Val Ser Leu He Leu Arg Asn Pro Lys Ala Met 
545 550 555 560 

25 Asp Ser Ala Arg Asn Gin Val Met Phe Ala Leu Asn Asp Tyr Leu Ala 

565 570 575 

Gly Leu Ala Leu Asp Gin Leu Ser Asn Gin Ala Ser Val Gly Gly He 
30 580 585 590 

Ser Phe Ser Thr Asn Ala Asn Asn Gly Leu Met Val Asn Ala Asn Gly 
595 600 605 

35 Tyr Thr Gin Arg Leu Pro Gin Leu Phe Gin Ala Leu Leu Glu Gly Tyr 
610 615 620 

Phe Ser Tyr Thr Ala Thr Glu Asp Gin Leu Glu Gin Ala Lys Ser Trp 
40 625 630 635 640 

Tyr Asn Gin Met Met Asp Ser Ala Glu Lys Gly Lys Ala Phe Glu Gin 
645 650 655 

45 

Ala He Met Pro Ala Gin Met Leu Ser Gin Val Pro Tyr Phe Ser Arg 
660 665 670 

50 

Asp Glu Arg Arg Lys He Leu Pro Ser He Thr Leu Lys Glu Val Leu 
675 680 685 

55 Ala Tyr Arg Asp Ala Leu Lys Ser Gly Ala Arg Pro Glu Phe Met Val 
690 695 700 



He Gly Asn Met Thr Glu Ala Gin Ala Thr Thr Leu Ala Arg Asp Val 
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705 710 715 720 

Gin Lys Gin Leu Gly Ala Asp Gly Ser Glu Trp Cys Arg Asn Lys Asp 
5 725 730 735 

Val Val Val Asp Lys Lys Gin Ser Val He Phe Glu Lys Ala Gly Asn 
740 745 750 

10 

Ser Thr Asp Ser Ala Leu Ala Ala Val Phe Val Pro Thr Gly Tyr Asp 
755 760 765 

15 

Glu Tyr Thr Ser Ser Ala Tyr Ser Ser Leu Leu Gly Gin He Val Gin 
, ^ 770 775 780 

20 Pro Trp Phe Tyr Asn Gin Leu Arg Thr Glu Glu Gin Leu Gly Tyr Ala 
785 790 795 800 

Val Phe Ala Phe Pro Met Ser Val Gly Arg Gin Trp Gly Met Gly Phe 
25 805 810 815 

Leu Leu Gin Ser Asn Asp Lys Gin Pro Ser Phe Leu Trp Glu Arg Tyr 
820 825 830 

30 

Lys Ala Phe Phe Pro Thr Ala Glu Ala Lys Leu Arg Ala Met Lys Pro 
835 840 845 

Asp Glu Phe Ala Gin He Gin Gin Ala Val He Thr Gin Met Leu Gin 
35 850 855 860 

Ala Pro Gin Thr Leu Gly Glu Glu Ala Ser Lys Leu Ser Lys Asp Phe 
865 870 875 880 

40 

Asp Arg Gly Asn Met Arg Phe Asp Ser Arg Asp Lys He Val Ala Gin 
885 890 895 

45 

He Lys Leu Leu Thr Pro Gin Lys Leu Ala Asp Phe Phe His Gin Ala 
900 905 910 

50 Val Val Glu Pro Gin Gly Met Ala He Leu Ser Gin He Ser Gly Ser 
915 920 925 

Gin Asn Gly Lys Ala Glu Tyr Val His Pro Glu Gly Trp Lys Val Trp 
55 930 935 940 



Glu Asn Val Ser Ala Leu Gin Gin Thr Met Pro Leu Met Ser Glu Lys 
945 950 955 960 
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Asn Glu 



<210> 10 

<211> 1161 

<212> PRT 

10 <213> Homo sapiens 

<400> 10 

Met Leu Arg Arg Val Ala Val Ala Ala Val Phe Ala Thr Gly Arg Lys 
15 1 5 10 15 

Leu Arg Cys Glu Ala Gly Arg Asp Val Thr Ala Val Gly Arg lie Glu 
20 25 30 

20 

Ala Arg Gly Leu Cys Glu Glu Ser Ala Lys Pro Phe Pro Thr Leu Thr 
35 40 45 

25 

Met Pro Gly Arg Asn Lys Ala Lys Ser Thr Cys Ser Cys Pro Asp Leu 
50 55 60 

30 Gin Pro Asn Gly Gin Asp Leu Gly Glu Ser Gly Arg Val Ala Arg Leu 
65 70 75 80 

Gly Ala Asp Glu Ser Glu Glu Glu Gly Arg Ser Leu Ser Asn Val Gly 
85 90 95 

35 

Asp Pro Glu lie lie Lys Ser Pro Ser Asp Pro Lys Gin Tyr Arg Tyr 
100 105 110 

40 He Lys Leu Gin Asn Gly Leu Gin Ala Leu Leu He Ser Asp Leu Ser 
115 120 125 

Asn Val Glu Gly Lys Thr Gly Asn Ala Thr Asp Glu Glu Glu Glu Glu 
45 130 135 140 

Glu Glu Glu Glu Glu Glu Gly Glu Glu Glu Glu Glu Glu Glu Glu Asp 
145 150 155 160 

50 

Asp Asp Asp Asp Asp Asp Glu Asp Ser Gly Ala Glu He Gin Asp Asp 
165 170 175 

55 

Asp Glu Glu Gly Phe Asp Asp Glu Glu Glu Phe Asp Asp Asp Glu His 
180 185 190 
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Asp Asp Asp Asp Leu Asp Asn Glu Glu Asn Glu Leu Glu Glu Leu Glu 
195 200 205 

5 Glu Arg Val Glu Ala Axg Lys Lys Thr Thr Glu Lys Gin Ser Ala Ala 
210 215 220 

^ic Ala Asp Pro Asp Asp Leu Pro 

^° 225 230 235 240 

Gly Leu Ala His Phe Leu Glu His Met Val Phe Met Gly Ser Leu Lys 
^5 245 250 255 

Tyr Pro Asp Glu Asn Gly Phe Asp Ala Phe Leu Lys Lys His Gly Glv 
260 265 270 



20 



Ser Asp Asn Ala Ser Thr Asp Cys Glu Arg Thr Val Phe Gin Phe Asp 
275 280 285 



25 Val Gin Arg Lys Tyr Phe Lys Glu Ala Leu Asp Arg Trp Ala Gin Phe 
290 295 300 

■=n l^t ^""^ ^9 Asp Ala lie Asp Arg Glu Val Glu 

30 305 310 315 320 

Ala Val Asp Ser Glu Tyr Gin Leu Ala Arg Pro Ser Asp Ala Asn Arg 
325 330 335 

35 Lys Glu Met Leu Phe Gly Ser Leu Ala Arg Pro Gly His Pro Met Gly 
340 345 350 

Lys Phe Phe Trp Gly Asn Ala Glu Thr Leu Lys His Glu Pro Lys Lys 
40 355 360 365 

Asn Asn lie Asp Thr His Ala Arg Leu Arg Glu Phe Trp Met Arg Tvr 
370 375 380 

45 

Tyr Ser Ala His Tyr Met Thr Leu Val Val Gin Ser Lys Glu Thr Leu 

390 395 400 

50 

Asp Thr Leu Glu Lys Trp Val Thr Glu lie Phe Ser Gin lie Pro Asn 
405 410 415 

55 Asn Gly Leu Pro Lys Pro Asn Phe Ser His Leu Thr Asp Pro Phe Asp 
420 425 430 



Ala Phe. Asn Lys Leu Tyr Arg Val Val Pro lie Arg Lys He 
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435 440 445 

His Ala Leu Thr lie Thr Trp Ala Leu Pro Pro Gin Gin Gin His Tyr 
5 450 455 460 

Arg Val Lys Pro Leu His Tyr lie Ser Trp Leu Val Gly His Glu Gly 
465 470 475 480 

10 

Lys Gly Ser lie Leu Ser Tyr Leu Arg Lys Lys Cys Trp Ala Leu Ala 
485 490 495 

15 

Leu Phe Gly Gly Asn Gly Glu Thr Gly Phe Glu Gin Asn Ser Thr Tyr 
500 505 510 

20 Ser Val Phe Ser lie Ser He Thr Leu Thr Asp Glu Gly Tyr Glu His 
515 520 525 

Phe Tyr Glu Val Ala His Thr Val Phe Gin Tyr Leu Lys Met Leu Gin 
25 530 535 540 

Lys Leu Gly Pro Glu Lys Arg Val Phe Glu Glu He Gin Lys He Glu 
545 550 555 560 

30 Asp Asn Glu Phe His Tyr Gin Glu Gin Thr Asp Pro Val Glu Tyr Val 

565 570 575 

Glu Asn Met Cys Glu Asn Met Gin Leu Tyr Pro Arg Gin Asp Phe Leu 
35 580 585 590 

Thr Gly Asp Gin Leu Leu Phe Glu Tyr Lys Pro Glu Val He Ala Glu 
595 600 605 

40 

Ala Leu Asn Gin Leu Val Pro Gin Lys Ala Asn Leu Val Leu Leu Ser 
610 615 620 

45 

Gly Ala Asn Glu Gly Arg Cys Asp Leu Lys Glu Lys Trp Phe Gly Thr 
625 630 635 640 

50 Gin Tyr Ser He Glu Asp He Glu Asn Ser Trp Thr Glu Leu Trp Lys 

645 650 655 

Ser Asn Phe Asp Leu Asn Ser Asp Leu His Leu Pro Ala Glu Asn Lys 
55 660 665 670 



Tyr He Ala Thr Asp Phe Thr Leu Lys Ala Phe Asp Cys Pro Glu Thr 
675 680 685 
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Glu Tyr Pro Ala Lys lie Val Asn Thr Pro Gin Gly Cys Leu Trp Tvr 
^ 690 695 700 

Lys Lys Asp Asn Lys Phe Lys He Pro Lys Ala Tyr He Arg Phe His 
705 710 715 720 

10 

Leu He Ser Pro Leu He Gin Lys Ser Ala Ala Asn Val Val Leu Phe 
725 730 735 

15 Asp He Phe Val Asn He Leu Thr His Asn Leu Ala Glu Pro Ala Tyr 
740 745 750 

Glu Ala Asp Val Ala Gin Leu Glu Tyr Lys Leu Val Ala Gly Glu His 
20 755 760 765 

Gly Leu He He Arg Val Lys Gly Phe Asn His Lys Leu Pro Leu Leu 
770 775 780 

25 

Phe Gin Leu He He Asp Tyr Leu Thr Glu Phe Ser Ser Thr Pro Ala 
785 790 795 800 

Val Phe Thr Met He Thr Glu Gin Leu Lys Lys Thr Tyr Phe Asn He 
30 805 810 815 

Leu He Lys Pro Glu Thr Leu Ala Lys Asp Val Arg Leu Leu He Leu 
820 825 830 

35 

Glu Tyr Ser Arg Trp Ser Met He Asp Lys Tyr Arg Ala Leu Met Asp 
835 840 845 

40 

Gly Leu Ser Leu Glu Ser Leu Leu Asn Phe Val Lys Asp Phe Lys Ser 
850 855 860 

45 Gin Leu Phe Val Glu Gly Leu Val Gin Gly Asn Val Thr Ser Thr Glu 
865 870 875 880 

Ser Met Asp Phe Leu Arg Tyr Val Val Asp Lys Leu Asn Phe Val Pro 
^0 885 890 895 

Leu Glu Arg Glu Met Pro Val Gin Phe Gin Val Val Glu Leu Pro Ser 
900 905 910 



Gly His His Leu Cys Lys Val Arg Ala Leu Asn Lys Gly Asp Ala Asn 
915 920 925 
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Ser Glu Val Thr Val Tyr Tyr Gin Ser Gly Thr Arg Ser Leu Arg Glu 
930 935 940 

5 

Tyr Thr Leu Met Glu Leu Leu Val Met His Met Glu Glu Pro Cvs Phe 

950 955 960 

10 Asp Phe Leu Arg Thr Lys Gin Thr Leu Gly Tyr His Val Tyr Pro Thr 

965 970 975 

Cys Arg Asn Thr Ser Gly He Leu Gly Phe Ser Val Thr Val Gly Thr 
" 980 985 990 

Gin Ala Thr Lys Tyr Asn Ser Glu Thr Val Asp Lys Lys He Glu Glu 
2Q 1000 1005 



Phe Leu Ser Ser Phe Glu Glu Lys He Glu Asn Leu Thr Glu Asp 
1010 1015 1020 



25 



Ala Phe Asn Thr Gin Val Thr Ala Leu He Lys Leu Lys Glu Cvs 
1025 1030 1035 

Glu Asp Thr His Leu Gly Glu Glu Val Asp Arg Asn Trp Asn Glu 
1040 1045 1050 



35 



Val Val Thr Gin Gin Tyr Leu Phe Asp Arg Leu Ala His Glu He 
1055 1060 1065 

Glu Ala Leu Lys Ser Phe Ser Lys Ser Asp Leu Val Ser Trp Phe 
1070 1075 1080 



40 Lys Ala His Arg Gly Pro Gly Ser Lys Met Leu Ser Val His Val 
1085 1090 1095 

Val Gly Tyr Gly Lys Tyr Glu Leu Glu Glu Asp Gly Ala Pro Val 
1100 1105 1110 

Cys Glu Asp Pro Asn Ser Arg Glu Gly Met Gin Leu He Tyr Leu 
1115 1120 1125 

50 

Pro Pro Ser Pro Leu Leu Ala Glu Ser Thr Thr Pro He Thr Asp 
1130 1135 1140 



55 

He Arg Ala Phe Thr Ala Thr Leu Ser Leu Phe Pro Tyr His Lys 
1145 1150 1155 
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He Val Lys 
1160 

5 <210> 11 

<211> 1019 

<212> PRT 

<213> Homo sapiens 

10 <400> 11 

Met Arg Tyr Arg Leu Ala Trp Leu Leu His Pro Ala Leu Pro Ser Thr 
15 10 15 

Phe Arg Ser Val Leu Gly Ala Arg Leu Pro Pro Pro Glu Arg Leu Cys 
20 25 30 



15 



20 Gly Phe Gin Lys Lys Thr Tyr Ser Lys Met Asn Asn Pro Ala He Lys 
35 40 45 ^ 

25 In^ '^^'^ Glu Tyr 

oO 55 



_ ^, 60 
Arg Gly Leu Glu Leu Ala Asn Gly lie Lys Val Leu Leu Met Ser Asp 

70 75 80 



30 Pro Thr Thr Asp Lys Ser Ser Ala Ala Leu Asp Val His He Gly Ser 

85 90 95 

Leu Ser Asp Pro Pro Asn He Ala Gly Leu Ser His Phe Cys Glu His 
100 105 110 

Met Leu Phe Leu Gly Thr Lys Lys Tyr Pro Lys Glu Asn Glu Tyr Ser 
40 120 125 

Gin Phe Leu Ser Glu His Ala Gly Ser Ser Asn Ala Phe Thr Ser Gly 

130 135 y 



45 



Glu His Thr Asn Tyr Tyr Phe Asp Val Ser His Glu His Leu Glu Gly 

150 155 3L60 



50 Ala Leu Asp Arg Phe Ala Gin Phe Phe Leu Cys Pro Leu Phe Asp Glu 

165 170 175 

Ser Cys Lys Asp Arg Glu Val Asn Ala Val Asp Ser Glu His Glu Lys 
" 180 185 190 

Asn Val Met Asn Asp Ala Trp Arg Leu Phe Gin Leu Glu Lys Ala Thr 
195 200 205 
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Gly Asn Pro Lys His Pro Phe Ser Lys Phe Gly Thr Gly Asn Lys Tyr 
210 215 220 

5 

Thr Leu Glu Thr Arg Pro Asn Gin Glu Gly lie Asp Val Arg Gin Glu 
225 230 235 240 

10 

Leu Leu Lys Phe His Ser Ala Tyr Tyr Ser Ser Asn Leu Met Ala Val 
245 250 255 



15 Cys Val Leu Gly Arg Glu Ser Leu Asp Asp Leu Thr Asn Leu Val Val 
260 265 270 



Lys Leu Phe Ser Glu Val Glu Asn Lys Asn Val Pro Leu Pro Glu Phe 
20 275 280 285 



Pro Glu His Pro Phe Gin Glu Glu His Leu Lys Gin Leu Tyr Lys He 
290 295 300 

25 Val Pro He Lys Asp He Arg Asn Leu Tyr Val Thr Phe Pro He Pro 
305 310 315 320 



Asp Leu Gin Lys Tyr Tyr Lys Ser Asn Pro Gly His Tyr Leu Gly His 
30 325 330 335 



Leu He Gly His Glu Gly Pro Gly Ser Leu Leu Ser Glu Leu Lys Ser 
340 345 350 

35 

Lys Gly Trp Val Asn Thr Leu Val Gly Gly Gin Lys Glu Gly Ala Arg 
355 360 365 

40 

Gly Phe Met Phe Phe He He Asn Val Asp Leu Thr Glu Glu Gly Leu 
370 375 380 



45 Leu His Val Glu Asp He He Leu His Met Phe Gin Tyr He Gin Lys 
385 390 395 400 



Leu Arg Ala Glu Gly Pro Gin Glu Trp Val Phe Gin Glu Cys Lys Asp 
50 405 410 415 



Leu Asn Ala Val Ala Phe Arg Phe Lys Asp Lys Glu Arg Pro Arg Gly 
420 425 430 

55 



Tyr Thr Ser Lys He Ala Gly He Leu His Tyr Tyr Pro Leu Glu Glu 
435 440 445 
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Val Leu Thr Ala Glu Tyr Leu Leu Glu Glu Phe Arg Pro Asp Leu He 
450 455 450 

5 

Glu Met Val Leu Asp Lys Leu Arg Pro Glu Asn Val Arg Val Ala He 

470 475 480 

10 Val Ser Lys Ser Phe Glu Gly Lys Thr Asp Arg Thr Glu Glu Trp Tyr 

485 490 495 

Gly Thr Gin Tyr Lys Gin Glu Ala He Pro Asp Glu Val He Lys Lys 
15 500 505 510 

Trp Gin Asn Ala Asp Leu Asn Gly Lys Phe Lys Leu Pro Thr Lys Asn 
SIS 520 525 

Glu Phe He Pro Thr Asn Phe Glu He Leu Pro Leu Glu Lys Glu Ala 
530 535 540 

Thr Pro Tyr Pro Ala Leu He Lys Asp Thr Val Met Ser Lys Leu Trp 
25 545 550 555 560 

Phe Lys Gin Asp Asp Lys Lys Lys Lys Pro Lys Ala Cys Leu Asn Phe 
565 570 575 

30 

Glu Phe Phe Ser Pro Phe Ala Tyr Val Asp Pro Leu His Cys Asn Met 
580 585 590 

35 

Ala Tyr Leu Tyr Leu Glu Leu Leu Lys Asp Ser Leu Asn Glu Tyr Ala 
595 600 605 

40 Tyr Ala Ala Glu Leu Ala Gly Leu Ser Tyr Asp Leu Gin Asn Thr He 
610 615 620 

Tyr Gly Met Tyr Leu Ser Val Lys Gly Tyr Asn Asp Lys Gin Pro He 
45 625 630 635 640 

Leu Leu Lys Lys He He Glu Lys Met Ala Thr Phe Glu He Asp Glu 
5Q 645 650 655 

Lys Arg Phe Glu He He Lys Glu Ala Tyr Met Arg Ser Leu Asn Asn 
660 665 670 

55 

Phe Arg Ala Glu Gin Pro His Gin His Ala Met Tyr Tyr Leu Arg Leu 
675 680 685 
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Leu Met Thr Glu Val Ala Trp Thr Lys Asp Glu Leu Lys Glu Ala Leu 
690 695 700 

. 5 Asp Asp Val Thr Leu Pro Arg Leu Lys Ala Phe lie Pro Gin Leu Leu 
705 710 715 720 

Ser Arg Leu His lie Glu Ala Leu Leu His Gly Asn He Thr Lys Gin 
10 725 730 735 

Ala Ala Leu Gly He Met Gin Met Val Glu Asp Thr Leu He Glu His 
740 745 750 

15 

Ala His Thr Lys Pro Leu Leu Pro Ser Gin Leu Val Arg Tyr Arg Glu 
755 760 765 

20 

Val Gin Leu Pro Asp Arg Gly Trp Phe Val Tyr Gin Gin Arg Asn Glu 
770 775 780 

Val His Asn Asn Cys Gly He Glu He Tyr Tyr Gin Thr Asp Met Gin 
785 790 795 800 

25 

Ser Thr Ser Glu Asn Met Phe Leu Glu Leu Phe Cys Gin He He Ser 
805 810 815 

30 

Glu Pro Cys Phe Asn Thr Leu Arg Thr Lys Glu Gin Leu Gly Tyr He 
820 825 830 

35 Val Phe Ser Gly Pro Arg -Arg Ala Asn Gly He Gin Ser Leu Arg Phe 
835 840 845 

He He Gin Ser Glu Lys Pro Pro His Tyr Leu Glu Ser Arg Val Glu 
40 850 855 860 

Ala Phe Leu He Thr Met Glu Lys Ser He Glu Asp Met Thr Glu Glu 
865 870 875 880 

45 

Ala Phe Gin Lys His He Gin Ala Leu Ala He Arg Arg Leu Asp Lys 
885 890 895 

50 

Pro Lys Lys Leu Ser Ala Glu Cys Ala Lys Tyr Trp Gly Glu He He 
900 905 910 

55 Ser Gin Gin Tyr Asn Phe Asp Arg Asp Asn Thr Glu Val Ala Tyr Leu 
915 920 925 



Lys Thr Leu Thr Lys Glu Asp He He Lys Phe Tyr Lys Glu Met Leu 
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930 935 940 

Ala Val Asp Ala Pro Arg Arg His Lys Val Ser Val His Val Leu Ala 
5 945 950 955 960 

Arg Glu Met Asp Ser Cys Pro Val Val Gly Glu Phe Pro Cys Gin Asn 
965 970 975 

Asp He Asn Leu Ser Gin Ala Pro Ala Leu Pro Gin Pro Glu Val He 
980 985 990 

15 

Gin Asn Met Thr Glu Phe Lys Arg Gly Leu Pro Leu Phe Pro Leu Val 
995 1000 1005 

20 Lys Pro His He Asn Phe Met Ala Ala Lys Leu 
1010 1015 
<210> 12 
<211> 1265 
<212> PRT 
25 <213> Arabidopsis thaliana 

<400> 12 

Met Ala Ser Ser Ser Ser Ser He Phe Thr Gly Val Lys Phe Ser Pro 
30 1 5 10 15 

He Leu Ala Pro Phe Asn Ser Gly Asp Ser Axg Arg Ser Arg Tyr Leu 
35 20 25 30 

Lys Asp Ser Arg Asn Lys Val Arg Phe Asn Pro Ser Ser Pro Arg Leu 
35 40 45 



40 



Thr Pro His Arg Val Arg Val Glu Ala Pro Ser Leu He Pro Tvr Asn 
50 55 60 



45 Gly Leu Trp Ala Ala Gin Pro Asn Ser His Lys Gly Arg Leu Lys Arg 
^5 70 75 80 

Asn He Val Ser Gly Lys Glu Ala Thr Gly He Ser Leu Ser Gin Glv 
50 85 90 95 

Arg Asn Phe Cys Leu Thr Cys Lys Arg Asn Gin Ala Gly He Arg Arg 
5^ 105 110 



Ala Leu Pro Ser Ala Phe Val Asp Arg Thr Ala Phe Ser Leu Ser 
115 120 125 
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Ser Ser Leu Thr Ser Ser Leu Arg Lys His Ser Gin lie Val Asn Ala 
130 

5 

Thr Leu Gly Pro Asp Glu Pro His Ala Ala Gly Thr Ala Trp Pro Asp 

150 155 



160 



10 Gly He Val Ala Glu Arg Gin Asp Leu Asp Leu Leu Pro Pro Glu He 

165 170 175 

Asp Ser Ala Glu Leu Glu Ala Phe Leu Gly Cys Glu Leu Pro Ser His 
" 180 185 190 

Pro Lys Leu His Arg Gly Gin Leu Lys Asn Gly Leu Arg Tyr Leu He 
20 200 205 

Leu Pro Asn Lys Val Pro Pro Asn Arg Phe Glu Ala His Met Glu Val 

215 220 

25 His Val Gly Ser He Asp Glu Glu Glu Asp Glu Gin Gly He Ala His 



230 235 



240 



30 "'^'^ val Ala Phe Leu Gly Ser Lys Lys Arg Glu Lys Leu 

245 250 255 



35 



40 



Leu Gly Thr Gly Ala Arg Ser Asn Ala Tyr Thr Asp Phe His His Thr 
260 265 270 

Val Phe His He His Ser Pro Thr His Thr Lys Asp Ser Glu Asp Asp 
275 280 285 

Leu Phe Pro Ser Val Leu Asp Ala Leu Asn Glu He Ala Phe His Pro 
290 295 300 



45 Lys Phe Leu Ser Ser Arg Val Glu Lys Glu Arg Arg Ala He Leu Ser 
•^"^ 310 315 

Glu Leu Gin Met Met Asn Thr He Glu Tyr Arg Val Asp Cys Gin Leu 
325 330 335 



55 



Leu Gin His Leu His Ser Glu Asn Lys Leu Gly Arg Arg Phe Pro He 
340 345 

Gly Leu Glu Glu Gin He Lys Lys Trp Asp Val Asp Lys He Arg Lys 
3" 360 365 
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Phe His Glu Arg Trp Tyr Phe Pro Ala Asn Ala Thr Leu Tyr lie Val 
370 375 

5 

Gly Asp lie Asp Asn He Pro Arg He Val His Asn He Glu Ala Val 
385 390 395 4OO 

10 Phe Gly Lys Asn Gly Leu Asp Asn Glu Ser Thr Pro Ser Ser Pro Ser 

405 410 415 

Pro Gly Ala Phe Gly Ala Met Ala Asn Phe Leu Val Pro Lys Leu Pro 
15 420 425 430 

Ala Gly Leu Gly Gly Thr Phe Ser Asn Glu Lys Thr Asn Thr Ala Asp 
435 440 445 

20 Gin Ser Lys Met He Lys Arg Glu Arg His Ala He Arg Pro Pro Val 
450 455 4g0 



Glu His Asn Trp Ser Leu Pro Gly Thr Ser Val Asp Leu Lys Pro Pro 
25 465 470 



475 



480 



30 



Gin He Phe Lys His Glu Leu Leu Gin Asn Phe Ala He Asn Met Phe 
485 490 495 



35 



Cys Lys He Pro Val Ser Lys Val Gin Thr Phe Gly Asp Leu Arg Asn 
500 505 510 

Val Leu Met Lys Arg He Phe Leu Ser Ala Leu His Phe Arg He Asn 
515 520 525 



40 Thr Arg Tyr Lys Ser Ser Asn Pro Pro Phe Thr Ser Val Glu Leu Asp 
530 535 540 



His Ser Asp Ser Gly Arg Glu Gly Cys Thr Val Thr Thr Leu Thr Val 
45 545 550 555 



560 



50 



Thr Ala Glu Pro Gin Asn Trp Gin Asn Ala Val Lys Val Ala Val Gin 
565 570 575 



55 



Glu Val Arg Arg Leu Lys Glu Phe Gly Val Thr Arg Gly Glu Leu Thr 
580 585 590 

Arg Tyr Met Asp Ala Leu Leu Lys Asp Ser Glu His Leu Ala Ala Met 
595 600 605 



wo 2004/027067 ^^CT/DK2003/000609 

46/67 

lie Asp Asn Val Ser Ser Val Asp Asn Leu Asp Phe He Met Glu Ser 
"° 615 620 

5 Asp Ala Leu Ser His Thr Val Met Asp Gin Thr Gin Gly His Glu Thr 

630 

Leu val Ala Val Ala Gly Thr Val Thr Leu Glu Glu Val Asn Thr Val 

650 655 



Gly Ala Lys Val Leu Glu Phe He Ser Asp Phe Gly Arg Pro Thr Ala 

15 



665 670 



Leu Leu Pro Ala Ala He Val Ala Cys Val Pro Thr Lys Val His Val 

680 685 

20 ""^^69^^ ""^^ ^^""eSs^ ^%or '^^^ 



Glu Ser Val Lys Ser Gly Leu Leu Ala Pro He Glu Ala Glu Pro Glu 
25 ^ -715 720 

Leu Glu Val Pro Lys Glu Leu He Ser Gin Ser Gin Leu Lys Glu Leu 
725 730 735 

30 

Thr Leu Gin Arg Asn Pro Cys Phe Val Pro He Pro Gly Ser Gly Leu 
740 745 

35 Thr Lys Leu His Asp Lys Glu Thr Gly He Thr Gin Leu Arg Leu Ser 
755 760 765 

40 ^^"^ ?70 775 ^^"^ ^^"^ ^^"^ Ser Arg Ala 



45 



50 



Gly Val Met Arg Leu He Val Gly Gly Gly Arg Ala Ala Glu Thr Ser 

790 795 800 

Asp Ser Lys Gly Ala Val Val Val Gly Val Arg Thr Leu Ser Glu Gly 

810 815 

Gly Arg Val Gly Asp Phe Ser Arg Glu Gin Val Glu Leu Phe Cys Val 
820 825 830 



55 Asn His Leu He Asn Cys Ser Leu Glu Ser Thr Glu Glu Phe He Ala 
835 840 845 

Met Glu Phe Arg Phe Thr Leu Arg Asp Asn Gly Met Gin Ala Ala Phe 
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850 855 860 

Gin Leu Leu His Met Val Leu Glu Arg Ser Val Trp Leu Glu Asp Ala 
^ 870 875 880 

Phe Asp Arg Ala Arg Gin Leu Tyr Leu Ser Tyr Phe Arg Ser He Pro 
10 885 890 895 

Lys Ser Leu Glu Arg Ala Thr Ala His Lys Leu Met He Ala Met Leu 

900 905 910 

15 

Asn Gly Asp Glu Arg Phe Val Glu Pro Thr Pro Lys Ser Leu Gin Ser 

915 920 925 
Leu Asn Leu Glu Ser Val Lys Asp Ala Val Met Ser His Phe Val Glv 

2Q 935 94 0 



25 



Asp Asn Met Glu Val Ser He Val Gly Asp Phe Ser Glu Glu Glu He 

950 955 960 

Glu Arg Cys He Leu Asp Tyr Leu Gly Thr Val Lys Ala Ser His Asp 

965 970 975 



30 Ser Ala Lys Pro Pro Gly Ser Glu Pro He Leu Phe Arg Gin Pro Thr 
980 985 990 

Ala Gly Leu Gin Phe Gin Gin Val Phe Leu Lys Asp Thr Asp Glu Arg 
995 1000 1005 

Ala Cys Ala Tyr He Ala Gly Pro Ala Pro Asn Arg Trp Gly Phe 
1010 1015 1020 

Thr Val Asp Gly Asp Asp Leu Phe Gin Ser Val Ser Lys Leu Pro 
1025 1030 1035 



45 



Val Ala His Asp Gly Leu Leu Lys Ser Glu Glu Gin Leu Leu Glu 
1040 1045 1050 



50 Gly Gly Asp Arg Glu Leu Gin Lys Lys Leu Arg Ala His Pro Leu 

1055 1060 1065 

f!!^ ^^-^ ^^"^ l-®" l-e" Ala Glu He He Asn Ser 

=^ 1070 1075 1080 



Leu Phe Thr Thr Val Arg Asp Ser Leu Gly Leu Thr Tyr Asp 
1085 1090 1095 



wo 2004/027067 



48/67 



'CT/DK2003/000609 



??nn ^^"^ ^9 Leu Lys Leu Gly Trp 

5 " 1105 1110 

Tyr yal He Ser Val Thr Ser Thr Pro Gly Lys Val Tyr Lys Ala 

1115 1120 1125 

10 

Val Asp Ala Cys Lys Asn Val Leu Arg Gly Leu His Ser Asn Gin 

1130 1135 j^^^Q 

15 He Ala Pro Arg Glu Leu Asp Arg Ala Lys Arg Thr Leu Leu Met 

^^^^ 1150 1155 

^""^iJin '^^^ ^^"^ Ala Tyr Trp Leu Asn Leu 

1160 1165 1170 



20 



Leu Ala His Leu Gin Ala Ser Ser Val Gin Arg Lys Glu Leu Ser 
1175 1180 1185 



^^"^ ^^"^ Glu Ala Ala Ser He Glu 

1190 1195 ^200 




40 



^^-^ ^^"^ Gl" <51u Pro Glu Asp Val Phe Ser 

1235 1240 1245 

y^Jn ^""^ ""^^ Ser Met Thr Thr Arg Pro 

1250 1255 1260 



45 Thr Thr 
1265 



<210> 13 

50 <211> 534 

<212> PRT 

<213> Homo sapiens 



55 



<400> 13 



Met Arg Pro Asp Asp Lys Tyr His Glu Lys Gin Ala Gin Val Glu Ala 
5 



10 15 
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Thr Lys Leu Lys Gin Lys Val Glu Ala Leu Ser Pro Gly Asp Arg Gin 
20 25 30 

5 Gin He Tyr Glu Lys Gly Leu Glu Leu Arg Ser Gin Gin Ser Lys Pro 
35 40 45 

Gin Asp Ala Ser Cys Leu Pro Ala Leu Lys Val Ser Asp He Glu Pro 
^° 55 60 

Thr He Pro Val Thr Glu Leu Asp Val Val Leu Thr Ala Gly Asp He 
15 70 75 80 

Pro Val Gin Tyr Cys Ala Gin Pro Thr Asn Gly Met Val Tyr Phe Ara 
85 90 95 

20 Ala Phe Ser Ser Leu Asn Thr Leu Pro Glu Glu Leu Arg Pro Tyr Val 
100 105 HO 

Pro Leu Phe Cys Ser Val Leu Thr Lys Leu Gly Cys Gly Leu Leu Asp 
25 115 120 125 

Tyr Arg Glu Gin Ala Gin Gin He Glu Leu Lys Thr Gly Gly Met Ser 
130 135 

30 



Ala Ser Pro His Val Leu Pro Asp Asp Ser His Met Asp Thr Tyr Glu 

35 



150 155 160 



Gin Gly Val Leu Phe Ser Ser Leu Cys Leu Asp Arg Asn Leu Pro Asp 
165 170 175 

40 Met Met Gin Leu Trp Ser Glu He Phe Asn Asn Pro Cys Phe Glu Glu 
180 185 190 

Glu Glu His Phe Lys Val Leu Val Lys Met Thr Ala Gin Glu Leu Ala 
195 200 205 

Asn Gly He Pro Asp Ser Gly His Leu Tyr Ala Ser He Arg Ala Glv 
210 215 220 



55 



Arg Thr Leu Thr Pro Ala Gly Asp Leu Gin Glu Thr Phe Ser Gly Met 

230 235 240 

Asp Gin Val Arg Leu Met Lys Arg He Ala Glu Met Thr Asp He Lys 
245 250 255 
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Pro He Leu Arg Lys Leu Pro Arg He Lys Lys His Leu Leu Asn Glv 
260 265 270 

5 Asp Asn Met Arg Cys Ser Val Asn Ala Thr Pro Gin Gin Met Pro Gin 
275 280 285 

,o "^^"^ ^^"^ ^^"^ Arg Ser He Gly Arg Ser Lys 

10 290 295 300 

Lys Glu Arg Arg Pro Val Arg Pro His Thr Val Glu Lys Pro Val Pro 

310 315 320 

15 Ser Ser Ser Gly Gly Asp Ala His Val Pro His Gly Ser Gin Val He 

325 330 335 

Arg Lys Leu Val Met Glu Pro Thr Phe Lys Pro Trp Gin Met Lys Thr 
20 340 345 350 

His Phe Leu Met Pro Phe Pro Val Asn Tyr Val Gly Glu Cys He Ara 
25 360 365 

Thr Val Pro Tyr Thr Asp Pro Asp His Ala Ser Leu Lys He Leu Ala 
370 375 380 



30 



Arg Leu Met Thr Ala Lys Phe Leu His Thr Glu He Arg Glu Lys Glv 

390 395 4OO 



35 Gly Ala Tyr Gly Gly Gly Ala Lys Leu Ser His Asn Gly He Phe Thr 

405 410 415 

Leu Tyr Ser Tyr Arg Asp Pro Asn Thr He Glu Thr Leu Gin Ser Phe 
40 420 425 430 

Gly Lys Ala Val Asp Trp Ala Lys Ser Gly Lys Phe Thr Gin Gin Asp 
45 ^35 440 445 

He Asp Glu Ala Lys Leu Ser Val Phe Ser Thr Val Asp Ala Pro Val 
450 455 460 



50 



Ala Pro Ser Asp Lys Gly Met Asp His Phe Leu Tyr Gly Leu Ser Asp 

470 475 480 



55 Glu Met Lys Gin Ala His Arg Glu Gin Leu Phe Ala Val Ser His Asp 

485 490 495 



Lys Leu Leu Ala Val Ser Asp Arg Tyr Leu Gly Thr Gly Lys Ser Thr 
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500 505 510 

His Gly Leu Ala He Leu Gly Pro Glu Asn Pro Lys He Ala Lys Asp 
5 515 520 525 

Pro Ser Trp He He Arg 
530 

10 



<210> 14 
<211> 409 
15 <212> PRT 

<213> Bacillus siobtilis 

<400> 14 

20 Met He Lys Arg Tyr Thr Cys Pro Asn Gly Val Arg He Val Leu Glu 
15 10 15 

Asn Asn Pro Thr Val Arg Ser Val Ala He Gly Val Trp He Gly Thr 
25 20 25 30 

Gly Ser Arg His Glu Thr Pro Glu He Asn Gly He Ser His Phe Leu 
3Q 35 40 45 

Glu His Met Phe Phe Lys Gly Thr Ser Thr Lys Ser Ala Arg Glu He 
50 55 60 

35 

Ala Glu Ser Phe Asp Arg He Gly Gly Gin Val Asn Ala Phe Thr Ser 

70 75 80 

40 Lys Glu Tyr Thr Cys Tyr Tyr Ala Lys Val Leu Asp Glu His Ala Asn 

85 90 95 

Tyr Ala Leu Asp Val Leu Ala Asp Met Phe Phe His Ser Thr Phe Asp 
100 105 110 

Glu Asn Glu Leu Lys Lys Glu Lys Asn Val Val Tyr Glu Glu He Lvs 
50 115 120 125 

Met Tyr Glu Asp Ala Pro Asp Asp He Val His Asp Leu Leu Ser Lys 
130 135 



55 



Ala Thr Tyr Gly Asn His Ser Leu Gly Tyr Pro He Leu Gly Thr Glu 

150 155 160 
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Glu Thr Leu Ala Ser Phe Asn Gly Asp Ser Leu Arg Gin Tyr Met His 

165 170 175 

5 Asp Tyr Tyr Thr Pro Asp Arg Val Val He Ser Val Ala Gly Asn He 
180 185 190 

Ser Asp Ser Phe He Lys Asp Val Glu Lys Trp Phe Gly Ser Tyr Glu 
10 195 200 205 

Ala Lys Gly Lys Ala Thr Gly Leu Glu Lys Pro Glu Phe His Thr Glu 
210 215 220 



15 



Lys Leu Thr Arg Lys Lys Glu Thr Glu Gin Ala His Leu Cys Leu Glv 
225 230 235 240 



20 Phe Lys Gly Leu Glu Val Gly His Glu Arg He Tyr Asp Leu He Val 

245 250 255 

Leu Asn Asn Val Leu Gly Gly Ser Met Ser Ser Arg Leu Phe Gin Asp 
25 260 265 270 

Val Arg Glu Asp Lys Gly Leu Ala Tyr Ser Val Tyr Ser Tyr His Ser 
275 280 285 

30 

Ser Tyr Glu Asp Ser Gly Met Leu Thr He Tyr Gly Gly Thr Gly Ala 
290 295 300 

35 

Asn Gin Leu Gin Gin Leu Ser Glu Thr He Gin Glu Thr Leu Ala Thr 
305 310 315 320 

40 Leu Lys Arg Asp Gly He Thr Ser Lys Glu Leu Glu Asn Ser Lys Glu 

325 330 335 

Gin Met Lys Gly Ser Leu Met Leu Ser Leu Glu Ser Thr Asn Ser Lys 
45 340 345 350 

Met Ser Arg Asn Gly Lys Asn Glu Leu Leu Leu Gly Lys His Lys Thr 
355 360 365 

50 

Leu Asp Glu He He Asn Glu Leu Asn Ala Val Asn Leu Glu Arg Val 
370 375 380 

55 

Asn Gly Leu Ala Arg Gin Leu Phe Thr Glu Asp Tyr Ala Leu Ala Leu 
385 390 395 400 
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lie Ser Pro Ser Gly Asn Met Pro Ser 
405 

5 <210> 15 
<211> 438 
<212> PRT 

<213> Mycobacterium tuberculosis 
10 <400> 15 

Met Pro Arg Arg Ser Pro Ala Asp Pro Ala Ala Ala Leu Ala Pro Arq 
1 5 10 15 

15 Arg Thr Thr Leu Pro Gly Gly Leu Arg Val Val Thr Glu Phe Leu Pro 
20 25 30 

Ala Val His Ser Ala Ser Val Gly Val Trp Val Gly Val Gly Ser Arg 
20 35 40 45 

Asp Glu Gly Ala Thr Val Ala Gly Ala Ala His Phe Leu Glu His Leu 
50 55 60 

25 

Leu Phe Lys Ser Thr Pro Thr Arg Ser Ala Val Asp lie Ala Gin Ala 
65 70 75 80 

30 

Met Asp Ala Val Gly Gly Glu Leu Asn Ala Phe Thr Ala Lys Glu His 
85 90 95 

35 Thr Cys Tyr Tyr Ala His Val Leu Gly Ser Asp Leu Pro Leu Ala Val 
100 105 110 

Asp Leu Val Ala Asp Val Val Leu Asn Gly Arg Cys Ala Ala Asp Asp 
40 115 120 125 

Val Glu Val Glu Arg Asp Val Val Leu Glu Glu lie Ala Met Arg Asp 
130 135 140 

45 

Asp Asp Pro Glu Asp Ala Leu Ala Asp Met Phe Leu Ala Ala Leu Phe 
145 150 155 160 

50 

Gly Asp His Pro Val Gly Arg Pro Val lie Gly Ser Ala Gin Ser Val 
165 170 175 

55 Ser Val Met Thr Arg Ala Gin Leu Gin Ser Phe His Leu Arg Arg Tyr 
180 185 



Thr Pro Glu Arg Met Val Val Ala Ala Ala Gly Asn Val Asp His Asp 
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195 200 205 

Gly Leu Val Ala Leu Val Arg Glu His Phe Gly Ser Arg Leu Val Arg 
5 210 215 220 

Gly Arg Arg Pro Val Ala Pro Arg Lys Gly Thr Gly Arg Val Asn Gly 
225 230 235 240 

10 Ser Pro Arg Leu Thr Leu Val Ser Arg Asp Ala Glu Gin Thr His Val 

245 250 255 

Ser Leu Gly He Arg Thr Pro Gly Arg Gly Trp Glu His Arg Trp Ala 
15 260 265 270 

Leu Ser Val Leu His Thr Ala Leu Gly Gly Gly Leu Ser Ser Arg Leu 
275 280 285 

20 

Phe Gin Glu Val Arg Glu Thr Arg Gly Leu Ala Tyr Ser Val Tyr Ser 
290 295 . 300 

25 

Ala Leu Asp Leu Phe Ala Asp Ser Gly Ala Leu Ser Val Tyr Ala Ala 
305 310 315 320 

30 Cys Leu Pro Glu Arg Phe Ala Asp Val Met Arg Val Thr Ala Asp Val 

325 330 335 

Leu Glu Ser Val Ala Arg Asp Gly He Thr Glu Ala Glu Cys Gly He 
35 340 345 350 

Ala Lys Gly Ser Leu Arg Gly Gly Leu Val Leu Gly Leu Glu Asp Ser 
355 360 365 

40 

Ser Ser Arg Met Ser Arg Leu Gly Arg Ser Glu Leu Asn Tyr Gly Lvs 
370 375 380 

45 

His Arg Ser He Glu His Thr Leu Arg Gin He Glu Gin Val Thr Val 
385 390 395 400 

50 Glu Glu Val Asn Ala Val Ala Arg His Leu Leu Ser Arg Arg Tyr Gly 

405 410 415 

Ala Ala Val Leu Gly Pro His Gly Ser Lys Arg Ser Leu Pro Gin Gin 
55 420 425 430 



Leu Arg Ala Met Val Gly 
435 
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<210> 16 

<211> 34 

5 <212> DNA 

<213> Artificial 



Sequence 



<220> 

10 <223> Oligonucleotide 
<400> 16 

aatagaagct tgtcgactga tctatccaaa actg 



15 



20 



34 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



17 
66 
DNA 

Artificial Sequence 



Oligonucleotide 



<400> 17 

25 aaaagagctc ggccagatct tctagaggat ccaagaattc tgttttatat ttgttgtaaa 
aagtag 



60 
66 



30 <210> 18 

<211> 37 

<212> DNA 

<213> Artificial Sequence 



35 



<220> 

<223> Oligonucleotide 



<400> 18 

40 ttttgaattc caagatctcc catgtctcta ctggtgg 



37 



<210> 19 

<211> 41 

45 <212> DNA 

<213> Artificial Sequence 



<220> 

50 <223> Oligonucleotide 

<400> 19 

ccccgagctc gtcgaccctt ctcgaaagct ttaacgaacg c 41 

55 

<210> 20 

<211> 43 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Oligonucleotide 
5 <400> 20 

ttttgaattc aaagaatgag atttccttca atttttactg cag 

<210> 21 
<211> 37 
10 <212> DNA 

<213> Artificial Sequence 



15 



40 



45 



43 



<220> 

<223> Oligonucleotide 
<400> 21 

tttttctaga ctaggagggg tactcatact cctcggc 



37 



<210> 22 

20 <211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

25 <223> Oligonucleotide 

<400> 22 

cgaatgtcca tcgttgcgaa cctgcagaac ctg 

30 <210> 23 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

35 <220> 

<223> Oligonucleotide 



33 



<400> 23 

caggttctgc aggttcctaa cgatggacat teg 

<210> 24 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



33 



<400> 24 

50 cgaatgtcca tcgttaggaa cctgcagaac ctg 



33 



<210> 25 

<211> 33 

55 <212> DNA 

<213> Artificial Sequence 



<220> 
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<223> Oligonucleotide 
<400> 25 

caggttctgc aggttcctaa cgatggacat teg 33 
5 <210> 26 
<211> 18 
<212> DNA 

<213> Artificial Sequence 

10 <220> 

<223> Oligonucleotide 

<400> 26 

tcgcagagaa cggatggc 18 

15 

<210> 27 

<211> 36 

<212> DNA 

20 <213> Artificial Sequence 

<220> 

<223> Oligonucleotide 
25 <400> 27 

ttttgggccc ttcatggtga tacggtatct cttggc 36 



<210> 28 

30 <211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Oligonucleotide 



40 



45 



<400> 28 

ttttctcgag aaggtggaac atactgccct gggatgg 



<210> 29 

<211> 38 

<212> DNA 

<213> Artificial Sequence 



37 



<220> 

<223> Oligonucleotide 
<400> 29 

50 ttttgagctc gtttaggaaa cgtccttggc ggagatgc 



38 



<210> 30 

<211> 40 

55 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 
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<400> 30 

tttttctaga cactgcgaat ccatggtata aaccaaaacc 

<210> 31 
<211> 24 
<212> DNA 

<213> Artificial Sequence 



40 



<220> 

10 <223> Oligonucleotide 



15 



20 



<400> 31 

gtcgttgttc atggacatac ctcc 



<210> 32 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 



24 



<400> 32 
25 tacaaatgtt cttctgccat ttctgg 



26 



<210> 33 

<211> 32 

30 <212> DNA 

<213> Artificial Sequence 



35 



<220> 

<223> Oligonucleotide 
<400> 33 

ggttcatatg cgccggagct cctcgacagc ag 32 



40 <210> 34 

<211> 37 

<212> DNA 

<213> Artificial Sequence 

45 <220> 

<223> Oligonucleotide 



50 



<400> 34 

ggttcctagg atccgcaagt ttgattccat tgcggtg 



37 



<210> 35 

<211> 70 

<212> DNA 

55 <213> Artificial Sequence 



<220> 

<223> Oligonucleotide 



10 
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<400> 35 

ttaaagagta ccttggctat agaataccgt agagataaag acctgaatag agattgtact 60 
gagagtgcac 70 

5 

<210> 36 
<211> 70 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide 



<400> 36 

15 aggtattata actatttttc tgtatttttt atatattttt atttgccaag ctgtgcggta 60 
tttcacaccg 70 



20 <210> 37 

<211> 23 

<212> DNA 

<213> Artificial Sequence 

25 <220> 

<223> Oligonucleotide 

<400> 37 

ctttggttaa agagtacctt ggc 23 

30 

<210> 38 

<211> 23 

<212> DNA 

35 <213> Artificial Sequence 

<220> 

<223> Oligonucleotide 

40 <400> 38 

tactacgaaa agcgtgtgcg agg 23 



<210> 39 

45 <211> 71 

<212> DNA 

<213> Artificial Sequence 

<220> 

50 <223> Oligonucleotide 

<400> 39 

tagaaggcta ctcaaaagaa taaagttact ataaaatata ctgcggtata tagattgtac 60 



55 tgagagtgca c 



71 
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<210> 40 
<211> 70 

<212> DNA 

<213> Artificial Sequence 

5 

<220> 

<223> Oligonucleotide 
<400> 40 

10 gatcggcaag aaactttgaa gcagtatatt tacaggatta aattatatat ctgtgcggta 60 
tttcacaccg -jq 



15 <210> 41 

<211> 22 

. <212> DNA 

<213> Artificial Sequence 

20 <220> 

<223> Oligonucleotide 

<400> 41 

cggaggggct ctatgataaa gg 

25 



<210> 42 

<211> 23 

<212> DNA 

30 <213> Artificial Sequence 

<220> 

<223> Oligonucleotide 

35 <400> 42 

gagtaactag ggcttctctt ccc 



22 



23 



<210> 43 

40 <211> 85 

<212> PRT 

<213> Homo sapiens 



<400> 43 

45 

Ser Gly Leu Gin Arg Ala Glu Glu Ala Pro Arg Arg Gin Leu Arg Val 
15 10 15 



50 Ser Gin Arg Thr Asp Gly Glu Ser Arg Ala His Leu Gly Ala Leu Leu 
20 25 30 



Ala Arg Tyr lie Gin Gin Ala Arg Lys Ala Pro Ser Gly Arg Met Ser 
55 35 40 45 



lie Val Lys Asn Leu Gin Asn Leu Asp Pro Ser His Arg lie Ser Asp 
50 55 60 
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Arg Asp Tyr Met Gly Trp Met Asp Phe Gly Arg Arg Ser Ala Glu Glu 
65 70 75 80 

5 

Tyr Glu Tyr Pro Ser 
85 

10 

<210> 44 

<211> 22 

<212> PRT 

<213> Homo sapiens 

15 

<400> 4 4 

Gin Leu Arg Val Ser Gin Arg Thr Asp Gly Glu Ser Arg Ala His Leu 
20 1 5 10 15 

Gly Ala Leu Leu Ala Arg 
20 

25 

<210> 45 
<211> 19 
<212> PRT 
30 <213> Homo sapiens 

<400> 45 

35 Val Ser Gin Arg Thr Asp Gly Glu Ser Arg Ala His Leu Gly Ala Leu 
15 10 15 

Leu Ala Arg 

40 



<210> 46 

<211> 51 

45 <212> PRT 

<213> Homo sapiens 

<400> 46 

50 

Tyr lie Gin Gin Ala Arg Lys Ala Pro Ser Gly Arg Met Ser He Val 
15 10 15 

55 Lys Asn Leu Gin Asn Leu Asp Pro Ser His Arg He Ser Asp Arg Asp 
20 25 30 

Tyr Met Gly Trp Met Asp Phe Gly Arg Arg Ser Ala Glu Glu Tyr Glu 
35 40 45 
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Tyr Pro Ser 
50 

5 <210> 47 
<211> 17 
<212> PRT 
<213> Homo sapiens 



10 



15 



<400> 47 

Tyr lie Gin Gin Ala Arg Lys Ala Pro Ser Gly Arg Met Ser lie Val 
15 10 15 

Lys 



20 <210> 48 

<211> 16 

<212> PRT 

<213> Homo sapiens 

25 <400> 48 

Tyr lie Gin Gin Ala Arg Lys Ala Pro Ser Gly Arg Met Ser He Val 
15 10 15 

30 <210> 49 

<211> 13 

<212> PRT 

<213> Homo sapiens 



35 



40 



<400> 49 

Asn Leu Gin Asn Leu Asp Pro Ser His Arg He Ser Asp 
15 10 



<210> 50 

<211> 23 

<212> PRT 

45 <213> Homo sapiens 



<400> 50 

50 Asn Leu Gin Asn Leu Asp Pro Ser His Arg He Ser Asp Arg Asp Tyr 
15 10 15 



Met Gly Trp Met Asp Phe Gly 
55 20 
<210> 51 
<211> 34 
<212> PRT 
<213> Homo sapiens 



wo 2004/027067 ^^PCT/DK2003/000609 

63/67 



<400> 51 

Asn Leu Gin Asn Leu Asp Pro Ser His Arg He Ser Asp Arg Asp Tyr 
5 1 5 10 15 

Met Gly Trp Met Asp Phe Gly Arg Arg Ser Ala Glu Glu Tyr Glu Tyr 
20 25 30 

10 

Pro Ser 



15 

<210> 53 
- - <211> 12- ----- 

<212> PRT 

<213> Artificial Sequence 

20 

<220> 

<223> Portion of fusion protein 



<400> 53 

25 

Lys Arg Glu Ala Glu Ala Ser Gly Leu Gin Arg Ala 
15 10 



30 


<210> 


54 






<211> 


9 






<212> 


PRT 






<213> 


Homo 


sapiens 


35 










<400> 


54 






Arg Met Ser 


He Val 




1 




5 


40 










<210> 


55 






<211> 


13 






<212> 


PRT 




45 


<213> 


Homo 


sapiens 




<400> 


55 





Asp Arg Asp Tyr Met Gly Trp Met Asp Phe Gly Arg Arg 
50 1 5 10 



<210> 56 
<211> 48 
55 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide MFalBNP (S) 
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5 



<400> 56 

ggataaaaga gaggctgaag ctcacccgct gggcagcccc ggttcagc 



48 



<210> 57 
<211> 48 
<212> DNA 

<213> Artificial Sequence 

10 

<220> 

<223> MFlaBNP (AS) 
<400> 57 

15 gctgaaccgg ggctgcccag cgggtgagct tcagcctctc ttttatcc 4 8 

<210> 58 
<211> 34 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<223> Oligonucleotide BNPS'EcoRI 



<210> 59 
30 <211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

35 <223> Oligonucleotide BNP3'Xbal 
<400> 59 

tttttctaga ttaatgccgc ctcagcactt tgc 33 



25 



<400> 58 

ttttgaattc atggatcccc agacagcacc ttcc 



34 



40 



<210> 60 
<211> 48 
<212> DNA 



<213> Artificial Sequence 



45 



<220> 

<223> Oligonucleotide MFIBNP (S) 



50 <400> 60 

ggataaaaga gaggctgaag ctcacccgct gggcagcccc ggttcagc 



48 



<210> 61 
<211> 48 



55 <212> DNA 

<213> Artificial Sequence 



<220> 

<223> Oligonucleotide MFIBNP (AS) 
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<400> 61 

gctgaaccgg ggctgcccag cgggtgagct tcagcctctc ttttatcc 48 

5 

<210> 62 
<211> 69 
<212> DNA 

<213> Artificial Sequence 

10 

<220> 

<223> Oligonucleotide YPS15'GD400 
<400> 62 

15 aaaaagataa ggtgaacacc aagcatatag tataatatta cctaccacat gattgtactg 60 
agagtgcac 69 

<210> 63 
<211> 70 
20 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide YPS13 '00400 

25 

<400> 63 

aactccaact ggcttggaga tgtgaatgtc taaactttgt gcaacggttt ctgtgcggta 60 
tttcacaccg 70 



30 <210> 64 
<211> 20 
<212> DNA 

<213> Artificial Sequence 

35 <220> 

<223> Oligonucleotide YPSIS'DC 

<400> 64 

tcgtttcact gatgtgtccg 

40 

<210> 65 
<211> 21 
<212> DNA 

<213> Artificial Sequence 

45 

<220> 

<223> Oligonucleotide YPSIS'DC 

<400> 65 
50 gattataggc catatcccag g 

<210> 66 
<211> 13 
<212> PRT 
55 <213> Artificial Sequence 



20 



21 



<220> 

<223> Pitrilysin consensus sequence 



wo 2004/027067 



66/67 



IPCT/DK2003/000609 



<221> VARIANT 

<222> (1) , . . (13) 

<223>'Xaa = Any Amino Acid 

5 

<221> VARIANT 

<222> (1) . . . (13) 

<223> Xaa = Any Amino Acid 

10 <221> VARIANT 

<222> (1) . . . (13) 

<223> Xaa = Any Amino Acid 

<221> UNSURE 
15 <222> (0) . . . (0) 

<400> 11 

Gly Xaa Xaa His Xaa Xaa Glu His Xaa Xaa Xaa Xaa Gly 
15 10 

20 

<210> 67 
<211> 44 
<212> PRT 
25 <213> Artificial Sequence 

<220> 

<223> Pitrilysin consensus sequence 

30 

<221> VARIANT 

<222> (1) . . . (44) 

<223> Xaa = Any Amino Acid 

35 <221> VARIANT 

<222> (1) ... (44) 

<223> Xaa = Any Amino Acid 

<221> VARIANT 
40 <222> (1) . . . (44) 

<223> Xaa = Any Amino Acid 

<221> VARIANT 
<222> (34) . . . (34) 
45 <223> Xaa = any amino acid or absent 

<221> VARIANT 
<222> (35) . . . (35) 

<223> Xaa = any amino acid or absent 

50 

<400> 12 

Gly Xaa Xaa His Xaa Xaa Glu His Xaa Xaa Xaa Xaa Gly Xaa Xaa Xaa 

15 10 15 

Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
55 20 25 30 

Xaa Xaa Xaa Asn Ala Xaa Thr Xaa Xaa Xaa Xaa Thr 
35 40 
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<210> 68 

<211> 44 

<212> PRT 

<213> Artificial Sequence 

5 

<220> 

<223> Pitrilysin consensus sequence 



10 <221> VARIANT 

<222> (1) . . . (44) 

<223> Xaa = Any Amino Acid 

<221> VARIANT 
15 <222> (1) . . . (44) 

<223> Xaa = Any Amino Acid 

<221> VARIANT 
<222> (1) . . . (44) 
20 <223> Xaa Any Amino Acid 

<221> VARIANT 
<222> (34) ... (34) 

<223> Xaa - any amino acid or absent 

25 

<221> VARIANT 
<222> (35) . . . (35) 

<223> Xaa — any amino acid or absent 
30 <400> 13 

Gly Xaa Xaa His Xaa Xaa Glu His Xaa Xaa Xaa Xaa Gly Xaa Xaa Lys 

15 10 15 

Tyr Pro Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Leu Xaa Xaa Xaa Xaa Xaa 
20 25 30 

35 Xaa Xaa Xaa Asn Ala Xaa Thr Xaa Xaa Xaa Xaa Thr 
35 40 



